MemFly: On-the-Fly Memory Optimization via Information Bottleneck Paper • 2602.07885 • Published 7 days ago • 7 • 3
Stemphonic: All-at-once Flexible Multi-stem Music Generation Paper • 2602.09891 • Published 5 days ago • 2 • 3
P-GenRM: Personalized Generative Reward Model with Test-time User-based Scaling Paper • 2602.12116 • Published 3 days ago • 3 • 3
Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift Paper • 1502.03167 • Published Feb 11, 2015 • 2 • 1
MOSS-Audio-Tokenizer: Scaling Audio Tokenizers for Future Audio Foundation Models Paper • 2602.10934 • Published 4 days ago • 47 • 4
The Devil Behind Moltbook: Anthropic Safety is Always Vanishing in Self-Evolving AI Societies Paper • 2602.09877 • Published 5 days ago • 182 • 9
A Survey of LLM-based Deep Search Agents: Paradigm, Optimization, Evaluation, and Challenges Paper • 2508.05668 • Published Aug 3, 2025 • 1 • 1
Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters Paper • 2602.10604 • Published 4 days ago • 173 • 5
Baichuan-M3: Modeling Clinical Inquiry for Reliable Medical Decision-Making Paper • 2602.06570 • Published 9 days ago • 59 • 3
Canzona: A Unified, Asynchronous, and Load-Balanced Framework for Distributed Matrix-based Optimizers Paper • 2602.06079 • Published 11 days ago • 18 • 3
On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models Paper • 2602.03392 • Published 12 days ago • 52 • 7
CoPE: Clipped RoPE as A Scalable Free Lunch for Long Context LLMs Paper • 2602.05258 • Published 10 days ago • 7 • 4
SocialVeil: Probing Social Intelligence of Language Agents under Communication Barriers Paper • 2602.05115 • Published 11 days ago • 18 • 9
Privileged Information Distillation for Language Models Paper • 2602.04942 • Published 11 days ago • 25 • 4
DFlash: Block Diffusion for Flash Speculative Decoding Paper • 2602.06036 • Published 10 days ago • 41 • 2
Learning Rate Matters: Vanilla LoRA May Suffice for LLM Fine-tuning Paper • 2602.04998 • Published 11 days ago • 6 • 5
MemSkill: Learning and Evolving Memory Skills for Self-Evolving Agents Paper • 2602.02474 • Published 13 days ago • 53 • 4
Agent Primitives: Reusable Latent Building Blocks for Multi-Agent Systems Paper • 2602.03695 • Published 12 days ago • 1 • 1
Spider-Sense: Intrinsic Risk Sensing for Efficient Agent Defense with Hierarchical Adaptive Screening Paper • 2602.05386 • Published 10 days ago • 69 • 4