Papers
updated
WorldVLA: Towards Autoregressive Action World Model
Paper
• 2506.21539
• Published • 40
Fast and Simplex: 2-Simplicial Attention in Triton
Paper
• 2507.02754
• Published • 25
IntFold: A Controllable Foundation Model for General and Specialized
Biomolecular Structure Prediction
Paper
• 2507.02025
• Published • 35
Thinking Beyond Tokens: From Brain-Inspired Intelligence to Cognitive
Foundations for Artificial General Intelligence and its Societal Impact
Paper
• 2507.00951
• Published • 24
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable
Reinforcement Learning
Paper
• 2507.01006
• Published • 253
Does Math Reasoning Improve General LLM Capabilities? Understanding
Transferability of LLM Reasoning
Paper
• 2507.00432
• Published • 79
CriticLean: Critic-Guided Reinforcement Learning for Mathematical
Formalization
Paper
• 2507.06181
• Published • 45
MiroMind-M1: An Open-Source Advancement in Mathematical Reasoning via
Context-Aware Multi-Stage Policy Optimization
Paper
• 2507.14683
• Published • 136
A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm
Bridging Foundation Models and Lifelong Agentic Systems
Paper
• 2508.07407
• Published • 99
MoBE: Mixture-of-Basis-Experts for Compressing MoE-based LLMs
Paper
• 2508.05257
• Published • 13
Grove MoE: Towards Efficient and Superior MoE LLMs with Adjugate Experts
Paper
• 2508.07785
• Published • 29
rStar2-Agent: Agentic Reasoning Technical Report
Paper
• 2508.20722
• Published • 118
Think in Games: Learning to Reason in Games via Reinforcement Learning
with Large Language Models
Paper
• 2508.21365
• Published • 29
Less is More: Recursive Reasoning with Tiny Networks
Paper
• 2510.04871
• Published • 513
Diffusion Transformers with Representation Autoencoders
Paper
• 2510.11690
• Published • 170
Agent Learning via Early Experience
Paper
• 2510.08558
• Published • 276
Demystifying Reinforcement Learning in Agentic Reasoning
Paper
• 2510.11701
• Published • 33
ERA: Transforming VLMs into Embodied Agents via Embodied Prior Learning
and Online Reinforcement Learning
Paper
• 2510.12693
• Published • 28
Information Gain-based Policy Optimization: A Simple and Effective
Approach for Multi-Turn LLM Agents
Paper
• 2510.14967
• Published • 34
Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale
Thinking Model
Paper
• 2510.18855
• Published • 73
LoongRL:Reinforcement Learning for Advanced Reasoning over Long Contexts
Paper
• 2510.19363
• Published • 63
MathSE: Improving Multimodal Mathematical Reasoning via Self-Evolving Iterative Reflection and Reward-Guided Fine-Tuning
Paper
• 2511.06805
• Published • 13
DiffCoT: Diffusion-styled Chain-of-Thought Reasoning in LLMs
Paper
• 2601.03559
• Published • 14
Self-Hinting Language Models Enhance Reinforcement Learning
Paper
• 2602.03143
• Published • 31
RLAnything: Forge Environment, Policy, and Reward Model in Completely Dynamic RL System
Paper
• 2602.02488
• Published • 36
WideSeek-R1: Exploring Width Scaling for Broad Information Seeking via Multi-Agent Reinforcement Learning
Paper
• 2602.04634
• Published • 99
Memory Intelligence Agent
Paper
• 2604.04503
• Published • 51