This is a ByteLlama 101M model pretrained on the Cosmopedia v2 portion of the SmolLM corpus for 2 epochs, followed by training on a subset of OSCAR for another epoch.
- Downloads last month
- 240
Model tree for mittagessen/bytellama_oscar
Base model
mittagessen/bytellama_random