Error-Free Linear Attention is a Free Lunch: Exact Solution from Continuous-Time Dynamics Paper • 2512.12602 • Published Dec 14, 2025 • 44
NORA-1.5: A Vision-Language-Action Model Trained using World Model- and Action-based Preference Rewards Paper • 2511.14659 • Published Nov 18, 2025 • 13
Training Vision-Language Process Reward Models for Test-Time Scaling in Multimodal Reasoning: Key Insights and Lessons Learned Paper • 2509.23250 • Published Sep 27, 2025 • 6
Error Typing for Smarter Rewards: Improving Process Reward Models with Error-Aware Hierarchical Supervision Paper • 2505.19706 • Published May 26, 2025 • 3
Emma-X: An Embodied Multimodal Action Model with Grounded Chain of Thought and Look-ahead Spatial Reasoning Paper • 2412.11974 • Published Dec 16, 2024 • 10
Error Typing for Smarter Rewards: Improving Process Reward Models with Error-Aware Hierarchical Supervision Paper • 2505.19706 • Published May 26, 2025 • 3