view article Article Granite 4.0 3B Vision: Compact Multimodal Intelligence for Enterprise Documents 1 day ago • 30
\$OneMillion-Bench: How Far are Language Agents from Human Experts? Paper • 2603.07980 • Published 24 days ago • 27
ScreenSpot-Pro: GUI Grounding for Professional High-Resolution Computer Use Paper • 2504.07981 • Published Apr 4, 2025 • 5
view article Article The First Healthcare Robotics Dataset and Foundational Physical AI Models for Healthcare Robotics 16 days ago • 22
Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections Paper • 2603.12180 • Published 21 days ago • 64
NVIDIA Nemotron v3 Collection Open, Production-ready Enterprise Models • 15 items • Updated 1 day ago • 255
SWE-rebench V2: Language-Agnostic SWE Task Collection at Scale Paper • 2602.23866 • Published Feb 27 • 88