view article Article Illustrating Reinforcement Learning from Human Feedback (RLHF) Dec 9, 2022 • 186
Running 1.98k 1.98k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
HuggingFaceFW/fineweb-edu-classifier Text Classification • Updated Nov 17, 2024 • 32.7k • • 169