Agentic Rubrics as Contextual Verifiers for SWE Agents Paper • 2601.04171 • Published 10 days ago • 10
Detecting and Preventing Hallucinations in Large Vision Language Models Paper • 2308.06394 • Published Aug 11, 2023
Rubrics as Rewards: Reinforcement Learning Beyond Verifiable Domains Paper • 2507.17746 • Published Jul 23, 2025 • 3
Tulu 3 Datasets Collection All datasets released with Tulu 3 -- state of the art open post-training recipes. • 33 items • Updated 25 days ago • 96