view article Article SmolLM3: smol, multilingual, long-context reasoner By loubnabnl and 22 others • 17 days ago • 576
NextCoder Collection NextCoder family of code-editing LMs developed with Selective Knowledge Transfer and its training data. • 6 items • Updated 15 days ago • 68
view article Article The Transformers Library: standardizing model definitions By lysandre and 3 others • May 15 • 116
Gemma 3 QAT Collection Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 15 items • Updated 14 days ago • 206
view article Article From Chunks to Blocks: Accelerating Uploads and Downloads on the Hub By jsulz and 3 others • Feb 12 • 69
llama.vim Collection Recommended models for the llama.vim and llama.vscode plugins • 9 items • Updated May 14 • 43
MobileLLM Collection Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 • 40 items • Updated about 1 month ago • 118
Llama 3.2 3B & 1B GGUF Quants Collection Llama.cpp compatible quants for Llama 3.2 3B and 1B Instruct models. • 4 items • Updated Sep 26, 2024 • 46