All Roads Lead to Rome: Incentivizing Divergent Thinking in Vision-Language Models Paper • 2604.00479 • Published 2 days ago • 14
Concerto: Joint 2D-3D Self-Supervised Learning Emerges Spatial Representations Paper • 2510.23607 • Published Oct 27, 2025 • 181
Spatial Forcing: Implicit Spatial Representation Alignment for Vision-language-action Model Paper • 2510.12276 • Published Oct 14, 2025 • 149