Smol Models - a BjornMelin Collection

BjornMelin 's Collections

Embedding Models

Single 4090 Laptop GPU

Legendary VL Models

Google

Llama

Qwen

LLMs

Smol Models

updated 15 days ago

My favorite smaller models under 10B parameters.

unsloth/DeepSeek-R1-0528-Qwen3-8B-GGUF

Text Generation • 8B • Updated Jun 16 • 176k • 303
nvidia/Llama-3.1-Nemotron-Nano-8B-v1

Text Generation • 8B • Updated May 8 • 116k • • 201
deepseek-ai/DeepSeek-R1-Distill-Llama-8B

Text Generation • 8B • Updated Feb 24 • 2.68M • • 796
Qwen/Qwen2.5-Coder-7B-Instruct

Text Generation • 8B • Updated Jan 12 • 756k • • 536
Qwen/Qwen2.5-7B-Instruct

Text Generation • 8B • Updated Jan 12 • 12.4M • • 777
VIDraft/Gemma-3-R1984-4B

Image-Text-to-Text • 4B • Updated Apr 10 • 1.24k • 18
meta-llama/Llama-3.1-8B-Instruct

Text Generation • 8B • Updated Sep 25, 2024 • 9.15M • • 4.57k
meta-llama/Llama-3.2-3B-Instruct

Text Generation • 3B • Updated Oct 24, 2024 • 1.74M • • 1.68k
openbmb/MiniCPM-o-2_6

Any-to-Any • 9B • Updated Jun 20 • 191k • 1.23k
microsoft/Phi-4-mini-reasoning

Text Generation • 4B • Updated May 1 • 9.75k • 202
microsoft/bitnet-b1.58-2B-4T

Text Generation • 0.8B • Updated May 1 • 6.06k • 1.16k
microsoft/bitnet-b1.58-2B-4T-gguf

Text Generation • 2B • Updated May 1 • 4.22k • 197
unsloth/Qwen3-4B-Thinking-2507-GGUF

4B • Updated about 1 month ago • 53.1k • 42
nvidia/NVIDIA-Nemotron-Nano-9B-v2

Text Generation • 9B • Updated 7 days ago • 81.3k • 314