Running 86 Unlocking On-Policy Distillation for Any Model Family 📝 86 Visualize on-policy distillation for any model family
Running on Zero 31 Gpt2 Multiplication Predictor 📈 31 Multiply large numbers using different reasoning methods
Running 593 Scaling test-time compute 📈 593 Run advanced search strategies to boost LLM problem solving