Running 2.62k 2.62k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
Running 564 564 Scaling test-time compute 📈 Enhance math problem solving by scaling test-time compute
view post Post 3848 just landed at Hugging Face Hub: community-led computer vision course 📖🤍 learn from fundamentals to details of the bleeding edge vision transformers! 1 reply · 🔥 16 16 ❤️ 13 13 👀 4 4 + Reply
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone Paper • 2404.14219 • Published Apr 22, 2024 • 257
Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing Paper • 2404.12253 • Published Apr 18, 2024 • 56
The Unreasonable Ineffectiveness of the Deeper Layers Paper • 2403.17887 • Published Mar 26, 2024 • 81