A Self-Evolving Framework for Efficient Terminal Agents via Observational Context Compression Paper • 2604.19572 • Published 4 days ago • 16
Reward Hacking in the Era of Large Models: Mechanisms, Emergent Misalignment, Challenges Paper • 2604.13602 • Published 10 days ago • 20
Elucidating the SNR-t Bias of Diffusion Probabilistic Models Paper • 2604.16044 • Published 8 days ago • 72
Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability Paper • 2604.06628 • Published 17 days ago • 319
DataFlex: A Unified Framework for Data-Centric Dynamic Training of Large Language Models Paper • 2603.26164 • Published 28 days ago • 359
Attention Sink in Transformers: A Survey on Utilization, Interpretation, and Mitigation Paper • 2604.10098 • Published 14 days ago • 75
KnowRL: Boosting LLM Reasoning via Reinforcement Learning with Minimal-Sufficient Knowledge Guidance Paper • 2604.12627 • Published 11 days ago • 98
LightThinker++: From Reasoning Compression to Memory Management Paper • 2604.03679 • Published 21 days ago • 37
OpenWorldLib: A Unified Codebase and Definition of Advanced World Models Paper • 2604.04707 • Published 19 days ago • 200
TriAttention: Efficient Long Reasoning with Trigonometric KV Compression Paper • 2604.04921 • Published 19 days ago • 110
SpikingBrain Technical Report: Spiking Brain-inspired Large Models Paper • 2509.05276 • Published Sep 5, 2025 • 5
SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization Paper • 2604.02268 • Published 23 days ago • 96
ClawKeeper: Comprehensive Safety Protection for OpenClaw Agents Through Skills, Plugins, and Watchers Paper • 2603.24414 • Published 30 days ago • 183
Dynin-Omni: Omnimodal Unified Large Diffusion Language Model Paper • 2604.00007 • Published Mar 9 • 19
Data Darwinism Part I: Unlocking the Value of Scientific Data for Pre-training Paper • 2602.07824 • Published Feb 8 • 18
LongCat-Next: Lexicalizing Modalities as Discrete Tokens Paper • 2603.27538 • Published 26 days ago • 144