UI-Voyager: A Self-Evolving GUI Agent Learning via Failed Experience Paper • 2603.24533 • Published 5 days ago • 41 • 4
RealMaster: Lifting Rendered Scenes into Photorealistic Video Paper • 2603.23462 • Published 6 days ago • 29 • 5
RealMaster: Lifting Rendered Scenes into Photorealistic Video Paper • 2603.23462 • Published 6 days ago • 29 • 5
daVinci-Env: Open SWE Environment Synthesis at Scale Paper • 2603.13023 • Published 17 days ago • 30 • 3
CLIPO: Contrastive Learning in Policy Optimization Generalizes RLVR Paper • 2603.10101 • Published 20 days ago • 5 • 2
Code2Math: Can Your Code Agent Effectively Evolve Math Problems Through Exploration? Paper • 2603.03202 • Published 27 days ago • 17 • 2
veScale-FSDP: Flexible and High-Performance FSDP at Scale Paper • 2602.22437 • Published Feb 25 • 7 • 2
On Data Engineering for Scaling LLM Terminal Capabilities Paper • 2602.21193 • Published Feb 24 • 101 • 4
UniWeTok: An Unified Binary Tokenizer with Codebook Size $\mathit{2^{128}}$ for Unified Multimodal Large Language Model Paper • 2602.14178 • Published Feb 15 • 14 • 2
Olaf-World: Orienting Latent Actions for Video World Modeling Paper • 2602.10104 • Published Feb 10 • 27 • 2
SAGE: Scalable Agentic 3D Scene Generation for Embodied AI Paper • 2602.10116 • Published Feb 10 • 12 • 2
DIFFA-2: A Practical Diffusion Large Language Model for General Audio Understanding Paper • 2601.23161 • Published Jan 30 • 10 • 3
OmegaUse: Building a General-Purpose GUI Agent for Autonomous Task Execution Paper • 2601.20380 • Published Jan 28 • 9 • 2