MuRF: Unlocking the Multi-Scale Potential of Vision Foundation Models Paper • 2603.25744 • Published 19 days ago • 13
SpatialBoost: Enhancing Visual Representation through Language-Guided Reasoning Paper • 2603.22057 • Published 22 days ago • 46
Astrolabe: Steering Forward-Process Reinforcement Learning for Distilled Autoregressive Video Models Paper • 2603.17051 • Published 28 days ago • 109
When Models Judge Themselves: Unsupervised Self-Evolution for Multimodal Reasoning Paper • 2603.21289 • Published 23 days ago • 35
Rethinking Token-Level Policy Optimization for Multimodal Chain-of-Thought Paper • 2603.22847 • Published 21 days ago • 26
Reaching Beyond the Mode: RL for Distributional Reasoning in Language Models Paper • 2603.24844 • Published 20 days ago • 10
Sparse but Critical: A Token-Level Analysis of Distributional Shifts in RLVR Fine-Tuning of LLMs Paper • 2603.22446 • Published 22 days ago • 9
mSFT: Addressing Dataset Mixtures Overfiting Heterogeneously in Multi-task SFT Paper • 2603.21606 • Published 22 days ago • 39
Scalable Prompt Routing via Fine-Grained Latent Task Discovery Paper • 2603.19415 • Published 26 days ago • 7
SpecEyes: Accelerating Agentic Multimodal LLMs via Speculative Perception and Planning Paper • 2603.23483 • Published 21 days ago • 62
Understanding the Challenges in Iterative Generative Optimization with LLMs Paper • 2603.23994 • Published 20 days ago • 28
Learning to Commit: Generating Organic Pull Requests via Online Repository Memory Paper • 2603.26664 • Published 18 days ago • 9
Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale Paper • 2603.25040 • Published 20 days ago • 130
From Static Templates to Dynamic Runtime Graphs: A Survey of Workflow Optimization for LLM Agents Paper • 2603.22386 • Published 22 days ago • 55
ARC-AGI-3: A New Challenge for Frontier Agentic Intelligence Paper • 2603.24621 • Published 21 days ago • 2
Omni-SimpleMem: Autoresearch-Guided Discovery of Lifelong Multimodal Agent Memory Paper • 2604.01007 • Published 13 days ago • 31
SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization Paper • 2604.02268 • Published 13 days ago • 93
Investigating Autonomous Agent Contributions in the Wild: Activity Patterns and Code Change over Time Paper • 2604.00917 • Published 14 days ago • 18
AgentSocialBench: Evaluating Privacy Risks in Human-Centered Agentic Social Networks Paper • 2604.01487 • Published 14 days ago • 10