open-r1 's Collections

Step 1: Reproducing DeepSeek's Distilled Models

Code for training and evaluation: https://github.com/huggingface/open-r1