SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization Paper • 2604.02268 • Published 5 days ago • 86
Reasoning Shift: How Context Silently Shortens LLM Reasoning Paper • 2604.01161 • Published 6 days ago • 28
ViGoR-Bench: How Far Are Visual Generative Models From Zero-Shot Visual Reasoners? Paper • 2603.25823 • Published 12 days ago • 42
Embarrassingly Simple Self-Distillation Improves Code Generation Paper • 2604.01193 • Published 6 days ago • 28
LatentUM: Unleashing the Potential of Interleaved Cross-Modal Reasoning via a Latent-Space Unified Model Paper • 2604.02097 • Published 5 days ago • 28
Conservative Offline Robot Policy Learning via Posterior-Transition Reweighting Paper • 2603.16542 • Published 21 days ago • 11
Alignment Makes Language Models Normative, Not Descriptive Paper • 2603.17218 • Published 20 days ago • 46
V_1: Unifying Generation and Self-Verification for Parallel Reasoners Paper • 2603.04304 • Published Mar 4 • 14
T2S-Bench & Structure-of-Thought: Benchmarking and Prompting Comprehensive Text-to-Structure Reasoning Paper • 2603.03790 • Published Mar 4 • 121
Lost in the Noise: How Reasoning Models Fail with Contextual Distractors Paper • 2601.07226 • Published Jan 12 • 33
Physics of Language Models: Part 4.1, Architecture Design and the Magic of Canon Layers Paper • 2512.17351 • Published Dec 19, 2025 • 28