UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning Paper • 2509.02544 • Published Sep 2, 2025 • 127
view article Article ScreenSuite - The most comprehensive evaluation suite for GUI Agents! +1 Jun 6, 2025 • 56
Running 23 Online-Mind2Web Leaderboard 🌐 23 Explore Mind2Web agent performance with interactive tables and charts
Running on CPU Upgrade 599 GAIA Leaderboard 🦾 599 Submit your model answers to GAIA benchmark and view leaderboard
Running 241 MedGemma - Radiology Explainer Demo 🩺 241 Radiology Image & Report Explainer Demo. Built with MedGemma
Agent S2: A Compositional Generalist-Specialist Framework for Computer Use Agents Paper • 2504.00906 • Published Apr 1, 2025 • 27