Scale AI
company
Verified
AI & ML interests
None defined yet.
Recent Activity
Papers
Agentic Rubrics as Contextual Verifiers for SWE Agents
ResearchRubrics: A Benchmark of Prompts and Rubrics For Evaluating Deep Research Agents
datasets
21
ScaleAI/lhaw
Viewer
•
Updated
•
285
•
11
ScaleAI/SWE-bench_Pro
Viewer
•
Updated
•
731
•
23.7k
•
49
ScaleAI/audiomc
Viewer
•
Updated
•
452
•
864
•
6
ScaleAI/SciPredict
Viewer
•
Updated
•
405
•
87
•
1
ScaleAI/PRBench
Viewer
•
Updated
•
1.65k
•
641
•
6
ScaleAI/MCP-Atlas
Viewer
•
Updated
•
500
•
611
•
7
ScaleAI/VisualToolBench
Viewer
•
Updated
•
1.2k
•
88
•
3
ScaleAI/dummy_mcp
Viewer
•
Updated
•
16
•
20
ScaleAI/researchrubrics
Viewer
•
Updated
•
101
•
178
•
17
ScaleAI/swe-oec-claude-expert
Viewer
•
Updated
•
1.27k
•
62
•
1