Running 3.72k The Ultra-Scale Playbook π 3.72k The ultimate guide to training LLM on large GPU Clusters
MaziyarPanahi/MixTAO-7Bx2-MoE-Instruct-v7.0-GGUF Text Generation β’ 13B β’ Updated Feb 4, 2024 β’ 52 β’ 9