CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence Paper • 2603.28032 • Published 3 days ago • 228
Length-Unbiased Sequence Policy Optimization: Revealing and Controlling Response Length Variation in RLVR Paper • 2602.05261 • Published Feb 5 • 52
Perceptual-Evidence Anchored Reinforced Learning for Multimodal Reasoning Paper • 2511.18437 • Published Nov 23, 2025 • 1
DeepSketcher: Internalizing Visual Manipulation for Multimodal Reasoning Paper • 2509.25866 • Published Sep 30, 2025 • 2