Running Agents 432 Reward Bench Leaderboard 📐 432 Explore and compare model scores on RewardBench benchmarks
Sleeping Agents 420 Whisper Speaker Diarization 🎎 420 Generate speaker‑labeled transcripts from video or audio