Papers
updated
The Art of Scaling Reinforcement Learning Compute for LLMs
Paper
• 2510.13786
• Published
• 32
Attention Is All You Need for KV Cache in Diffusion LLMs
Paper
• 2510.14973
• Published
• 42
Paper
• 2510.13998
• Published
• 59
GigaBrain-0: A World Model-Powered Vision-Language-Action Model
Paper
• 2510.19430
• Published
• 52
Every Question Has Its Own Value: Reinforcement Learning with Explicit
Human Values
Paper
• 2510.20187
• Published
• 19
LoongRL:Reinforcement Learning for Advanced Reasoning over Long Contexts
Paper
• 2510.19363
• Published
• 62
Qwen3-Omni Technical Report
Paper
• 2509.17765
• Published
• 149
T-pro 2.0: An Efficient Russian Hybrid-Reasoning Model and Playground
Paper
• 2512.10430
• Published
• 116
Efficient-DLM: From Autoregressive to Diffusion Language Models, and Beyond in Speed
Paper
• 2512.14067
• Published
• 16
Physics of Language Models: Part 4.1, Architecture Design and the Magic of Canon Layers
Paper
• 2512.17351
• Published
• 28
DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI
Paper
• 2512.16676
• Published
• 219
Reinforcement Learning for Self-Improving Agent with Skill Library
Paper
• 2512.17102
• Published
• 36
mHC: Manifold-Constrained Hyper-Connections
Paper
• 2512.24880
• Published
• 312
TransMLA: Multi-head Latent Attention Is All You Need
Paper
• 2502.07864
• Published
• 57
Endless Terminals: Scaling RL Environments for Terminal Agents
Paper
• 2601.16443
• Published
• 18
gpt-oss-120b & gpt-oss-20b Model Card
Paper
• 2508.10925
• Published
• 15