Foundation Text-Generation Models Below 360M Parameters Collection Great candidates for fine-tuning targeting Wllama and Transformers.js for mobile devices, ordered by number of parameters. โข 36 items โข Updated 21 days ago โข 30
Trained Models ๐๏ธ Collection They may be small, but they're training like giants! โข 8 items โข Updated Dec 3, 2024 โข 20
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper โข 2502.02737 โข Published Feb 4 โข 229