RoboOmni Collection Proactive Robot Manipulation in Omni-modal Context • 9 items • Updated 9 days ago • 11
Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm Paper • 2511.04570 • Published Nov 6, 2025 • 243
SRPO Collection Official Collections for SRPO: Self-Referential Policy Optimization for Vision-Language-Action Models, including SFT and RL models. • 5 items • Updated Feb 11