nvidia/Nemotron-Research-Reasoning-Qwen-1.5B Text Generation • 2B • Updated 26 days ago • 11.9k • • 170
Reward Bench 2 Collection Datasets, spaces, and models for Reward Bench 2 benchmark and paper! • 11 items • Updated 28 days ago • 12
SynLogic Collection Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond • 5 items • Updated 28 days ago • 11
Holo1 Collection Vision-Language Action Model for use in Surfer-H web navigation agent • 6 items • Updated 20 days ago • 47
MiniCPM4 Collection MiniCPM4: Ultra-Efficient LLMs on End Devices • 22 items • Updated 9 days ago • 64
RoboBrain2.0 Collection RoboBrain 2.0: See Better. Think Harder. Do Smarter. • 1 item • Updated 26 days ago • 8