Running 2.47k 2.47k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
Running on Zero 463 463 Chat with DeepSeek-VL2-small 🌍 Generate responses using images and text input
Running 916 916 FineWeb: decanting the web for the finest text data at scale 🍷 Generate high-quality web text data for LLM training
Foundation AI Papers Collection Curated List of Must-Reads on LLM reasoning at Temus AI team • 135 items • Updated Jun 15, 2024 • 32