SWE-Universe: Scale Real-World Verifiable Environments to Millions Paper • 2602.02361 • Published 10 days ago • 59
Jet-RL: Enabling On-Policy FP8 Reinforcement Learning with Unified Training and Rollout Precision Flow Paper • 2601.14243 • Published 23 days ago • 21
Youtu-VL: Unleashing Visual Potential via Unified Vision-Language Supervision Paper • 2601.19798 • Published 16 days ago • 42
view article Article Introducing Waypoint-1: Real-time interactive video diffusion from Overworld +3 24 days ago • 37
The Flexibility Trap: Why Arbitrary Order Limits Reasoning Potential in Diffusion Language Models Paper • 2601.15165 • Published 22 days ago • 71
Watching, Reasoning, and Searching: A Video Deep Research Benchmark on Open Web for Agentic Video Reasoning Paper • 2601.06943 • Published Jan 11 • 211
EnvScaler Collection The official datasets and models of "EnvScaler: Scaling Tool-Interactive Environments for LLM Agent via Programmatic Synthesis" • 8 items • Updated about 1 month ago • 3
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper • 2501.04519 • Published Jan 8, 2025 • 288
💧 LFM2.5 Collection Collection of Instruct, Base, and Japanese LFM2.5-1.2B models. • 22 items • Updated 11 days ago • 85
Falcon-H1R: Pushing the Reasoning Frontiers with a Hybrid Model for Efficient Test-Time Scaling Paper • 2601.02346 • Published Jan 5 • 26
TimeBill: Time-Budgeted Inference for Large Language Models Paper • 2512.21859 • Published Dec 26, 2025 • 25
view article Article nanoVLM: The simplest repository to train your VLM in pure PyTorch +5 May 21, 2025 • 251