Mathematics Benchmark Datasets knoveleng/AMC-23 Viewer • Updated Mar 14 • 40 • 3.88k • 1 knoveleng/Minerva-Math Viewer • Updated Mar 14 • 272 • 3.72k • 1 knoveleng/OlympiadBench Viewer • Updated Mar 14 • 675 • 3.52k • 1
Open-RS Model weights & datasets in the paper "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn’t" Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't Paper • 2503.16219 • Published Mar 20 • 51 knoveleng/OpenRS-GRPO Text Generation • 2B • Updated Mar 21 • 53 • 5 knoveleng/Open-RS1 Text Generation • 2B • Updated Mar 24 • 813 • 4 knoveleng/Open-RS2 Text Generation • 2B • Updated Mar 24 • 961 • 1
Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't Paper • 2503.16219 • Published Mar 20 • 51
Mathematics Benchmark Datasets knoveleng/AMC-23 Viewer • Updated Mar 14 • 40 • 3.88k • 1 knoveleng/Minerva-Math Viewer • Updated Mar 14 • 272 • 3.72k • 1 knoveleng/OlympiadBench Viewer • Updated Mar 14 • 675 • 3.52k • 1
Open-RS Model weights & datasets in the paper "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn’t" Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't Paper • 2503.16219 • Published Mar 20 • 51 knoveleng/OpenRS-GRPO Text Generation • 2B • Updated Mar 21 • 53 • 5 knoveleng/Open-RS1 Text Generation • 2B • Updated Mar 24 • 813 • 4 knoveleng/Open-RS2 Text Generation • 2B • Updated Mar 24 • 961 • 1
Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't Paper • 2503.16219 • Published Mar 20 • 51