Scale AI
company
Verified
AI & ML interests
None defined yet.
Recent Activity
Papers
Agentic Rubrics as Contextual Verifiers for SWE Agents
ResearchRubrics: A Benchmark of Prompts and Rubrics For Evaluating Deep Research Agents
datasets
21
ScaleAI/audiomc
Viewer
•
Updated
•
452
•
1.18k
•
8
ScaleAI/lhaw
Viewer
•
Updated
•
285
•
19
•
2
ScaleAI/SWE-bench_Pro
Viewer
•
Updated
•
731
•
35.6k
•
50
ScaleAI/SciPredict
Viewer
•
Updated
•
405
•
50
•
1
ScaleAI/PRBench
Viewer
•
Updated
•
1.65k
•
691
•
6
ScaleAI/MCP-Atlas
Viewer
•
Updated
•
500
•
532
•
7
ScaleAI/VisualToolBench
Viewer
•
Updated
•
1.2k
•
97
•
3
ScaleAI/dummy_mcp
Viewer
•
Updated
•
16
•
20
ScaleAI/researchrubrics
Viewer
•
Updated
•
101
•
185
•
17
ScaleAI/swe-oec-claude-expert
Viewer
•
Updated
•
1.27k
•
66
•
1