Scale AI

company

Verified

https://scale.com/

scale_ai

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

agosai new activity 1 day ago

ScaleAI/audiomc:Incorrect transcript for klvkcs0dggmesj0i, user turn 5

agosai updated a dataset 1 day ago

ScaleAI/audiomc

utkarsh4430 updated a dataset 17 days ago

ScaleAI/SciPredict

View all activity

Papers

Agentic Rubrics as Contextual Verifiers for SWE Agents

ResearchRubrics: A Benchmark of Prompts and Rubrics For Evaluating Deep Research Agents

View all Papers

agosai

in ScaleAI/audiomc 1 day ago

Incorrect transcript for klvkcs0dggmesj0i, user turn 5

#1 opened 1 day ago by

agosai

updated a dataset 1 day ago

ScaleAI/audiomc

Viewer • Updated 1 day ago • 452 • 289 • 4

utkarsh4430

updated a dataset 17 days ago

ScaleAI/SciPredict

Viewer • Updated 17 days ago • 405 • 74 • 1

utkarsh4430

published a dataset 17 days ago

ScaleAI/SciPredict

Viewer • Updated 17 days ago • 405 • 74 • 1

feyzaakyurek

updated a dataset 17 days ago

ScaleAI/PRBench

Viewer • Updated 17 days ago • 1.65k • 422 • 6

taesiri

submitted a paper to Daily Papers 25 days ago

Agentic Rubrics as Contextual Verifiers for SWE Agents

Paper • 2601.04171 • Published 25 days ago • 11

bhertz

updated a dataset about 1 month ago

ScaleAI/MCP-Atlas

Viewer • Updated Dec 19, 2025 • 500 • 510 • 6

agosai

updated a collection about 2 months ago

AudioMultiChallenge

1 item • Updated Dec 18, 2025

bhertz

published a dataset about 2 months ago

ScaleAI/MCP-Atlas

Viewer • Updated Dec 19, 2025 • 500 • 510 • 6

pmannam

published a dataset about 2 months ago

ScaleAI/robotics-meerkat

Updated Dec 17, 2025 • 4

agosai

published a dataset about 2 months ago

ScaleAI/audiomc

Viewer • Updated 1 day ago • 452 • 289 • 4

utkarsh4430

updated a dataset about 2 months ago

ScaleAI/VisualToolBench

Viewer • Updated Dec 16, 2025 • 1.2k • 51 • 2

yunzhong-scale

updated a dataset about 2 months ago

ScaleAI/VisualToolBench

Viewer • Updated Dec 16, 2025 • 1.2k • 51 • 2

bhertz

updated a dataset 2 months ago

ScaleAI/dummy_mcp

Viewer • Updated Dec 2, 2025 • 16 • 12

bhertz

published a dataset 2 months ago

ScaleAI/dummy_mcp

Viewer • Updated Dec 2, 2025 • 16 • 12

manasisharma

authored 4 papers 3 months ago

BEHAVIOR-1K: A Human-Centered, Embodied AI Benchmark with 1,000 Everyday Activities and Realistic Simulation

Paper • 2403.09227 • Published Mar 14, 2024 • 1

Adaptive Guidance Accelerates Reinforcement Learning of Reasoning Models

Paper • 2506.13923 • Published Jun 16, 2025

Remote Labor Index: Measuring AI Automation of Remote Work

Paper • 2510.26787 • Published Oct 30, 2025 • 7

ResearchRubrics: A Benchmark of Prompts and Rubrics For Evaluating Deep Research Agents

Paper • 2511.07685 • Published Nov 10, 2025 • 10

calvincbzhang

updated a collection 3 months ago

ResearchRubrics

1 item • Updated Nov 16, 2025