SketchVLM: Vision language models can annotate images to explain thoughts and guide users Paper • 2604.22875 • Published 6 days ago • 22
World-R1: Reinforcing 3D Constraints for Text-to-Video Generation Paper • 2604.24764 • Published 2 days ago • 104
ReVSI: Rebuilding Visual Spatial Intelligence Evaluation for Accurate Assessment of VLM 3D Reasoning Paper • 2604.24300 • Published 2 days ago • 57
Exploring Spatial Intelligence from a Generative Perspective Paper • 2604.20570 • Published 7 days ago • 21
(1D) Ordered Tokens Enable Efficient Test-Time Search Paper • 2604.15453 • Published 13 days ago • 19
MultiWorld: Scalable Multi-Agent Multi-View Video World Models Paper • 2604.18564 • Published 9 days ago • 43
OneVL: One-Step Latent Reasoning and Planning with Vision-Language Explanation Paper • 2604.18486 • Published 9 days ago • 88
Learning Adaptive Reasoning Paths for Efficient Visual Reasoning Paper • 2604.14568 • Published 13 days ago • 8
SpatialEvo: Self-Evolving Spatial Intelligence via Deterministic Geometric Environments Paper • 2604.14144 • Published 14 days ago • 62
You Only Judge Once: Multi-response Reward Modeling in a Single Forward Pass Paper • 2604.10966 • Published 16 days ago • 11
Attention Sink in Transformers: A Survey on Utilization, Interpretation, and Mitigation Paper • 2604.10098 • Published 18 days ago • 77
SkillClaw: Let Skills Evolve Collectively with Agentic Evolver Paper • 2604.08377 • Published 20 days ago • 287
VisionFoundry: Teaching VLMs Visual Perception with Synthetic Images Paper • 2604.09531 • Published 19 days ago • 8
WildDet3D: Scaling Promptable 3D Detection in the Wild Paper • 2604.08626 • Published 20 days ago • 242
OpenSpatial: A Principled Data Engine for Empowering Spatial Intelligence Paper • 2604.07296 • Published 21 days ago • 39
Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability Paper • 2604.06628 • Published 21 days ago • 323
HY-Embodied-0.5: Embodied Foundation Models for Real-World Agents Paper • 2604.07430 • Published 21 days ago • 187