ECHO-2: A Large-Scale Distributed Rollout Framework for Cost-Efficient Reinforcement Learning Paper • 2602.02192 • Published 16 days ago • 12
MARS: Unleashing the Power of Speculative Decoding via Margin-Aware Verification Paper • 2601.15498 • Published 28 days ago
ECHO-2: A Large-Scale Distributed Rollout Framework for Cost-Efficient Reinforcement Learning Paper • 2602.02192 • Published 16 days ago • 12
Group Pattern Selection Optimization: Let LRMs Pick the Right Pattern for Reasoning Paper • 2601.07238 • Published Jan 12
Teaching Large Reasoning Models Effective Reflection Paper • 2601.12720 • Published about 1 month ago
Code-Vision: Evaluating Multimodal LLMs Logic Understanding and Code Generation Capabilities Paper • 2502.11829 • Published Feb 17, 2025