Running 2.8k 2.8k The Ultra-Scale Playbook š The ultimate guide to training LLM on large GPU Clusters
Running 573 573 Scaling test-time compute š Enhance math problem solving by scaling test-time compute
HuggingFaceTB/SmolLM-135M-Instruct Text Generation ⢠0.1B ⢠Updated Sep 4, 2024 ⢠12.6k ⢠119