view article Article Fit More and Train Faster With ZeRO via DeepSpeed and FairScale Jan 19, 2021 • 4
The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale Paper • 2406.17557 • Published Jun 25 • 86
Jais and Jais-chat: Arabic-Centric Foundation and Instruction-Tuned Open Generative Large Language Models Paper • 2308.16149 • Published Aug 30, 2023 • 25