Rewards as Labels: Revisiting RLVR from a Classification Perspective Paper • 2602.05630 • Published Feb 5 • 3
view article Article Introducing smolagents: simple agents that write actions in code. +1 Dec 31, 2024 • 1.19k
Alibaba-NLP/Tongyi-DeepResearch-30B-A3B Text Generation • 31B • Updated Oct 10, 2025 • 23.3k • 808
Running on CPU Upgrade 599 GAIA Leaderboard 🦾 599 Submit your model answers to GAIA benchmark and view leaderboard