![](https://cdn-avatars.huggingface.co/v1/production/uploads/66f889e35144a8d0c68b8078/8YIWauvyekvXlIliPAbdU.jpeg)
sthenno-com/miscii-14b-1225
Text Generation
•
Updated
•
434
•
23
datatrove
for all things web-scale data preparation: https://github.com/huggingface/datatrovenanotron
for lightweight 4D parallelism LLM training: https://github.com/huggingface/nanotronlighteval
for in-training fast parallel LLM evaluations: https://github.com/huggingface/lighteval