view article Article Enabling Long Context Training with Sequence Parallelism in Axolotl By axolotl-ai-co and 1 other • Apr 4 • 8
Running 2.66k 2.66k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters