SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization Paper • 2604.02268 • Published 4 days ago • 83
Reasoning Shift: How Context Silently Shortens LLM Reasoning Paper • 2604.01161 • Published 4 days ago • 27
ViGoR-Bench: How Far Are Visual Generative Models From Zero-Shot Visual Reasoners? Paper • 2603.25823 • Published 10 days ago • 42
Embarrassingly Simple Self-Distillation Improves Code Generation Paper • 2604.01193 • Published 4 days ago • 24
LatentUM: Unleashing the Potential of Interleaved Cross-Modal Reasoning via a Latent-Space Unified Model Paper • 2604.02097 • Published 4 days ago • 27
Conservative Offline Robot Policy Learning via Posterior-Transition Reweighting Paper • 2603.16542 • Published 19 days ago • 11
Alignment Makes Language Models Normative, Not Descriptive Paper • 2603.17218 • Published 19 days ago • 46
V_1: Unifying Generation and Self-Verification for Parallel Reasoners Paper • 2603.04304 • Published Mar 4 • 14
T2S-Bench & Structure-of-Thought: Benchmarking and Prompting Comprehensive Text-to-Structure Reasoning Paper • 2603.03790 • Published Mar 4 • 121
Lost in the Noise: How Reasoning Models Fail with Contextual Distractors Paper • 2601.07226 • Published Jan 12 • 33