Edit Models filters

Apps

Docker Model Runner

Inference Providers

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

200

Full-text search

Active filters: llama.cpp

tifin-india/sarvam-m-24b-q5-1-gguf

Text Generation • 24B • Updated May 24 • 16

tifin-india/sarvam-m-24b-q2-k-gguf

Text Generation • 24B • Updated May 24 • 20

tifin-india/sarvam-m-24b-f16-gguf

Text Generation • 24B • Updated May 24 • 25

tifin-india/sarvam-m-24b-q3-k-l-gguf

Text Generation • 24B • Updated May 24 • 17

tifin-india/sarvam-m-24b-q3-k-s-gguf

Text Generation • 24B • Updated May 24 • 42

tifin-india/sarvam-m-24b-q3-k-gguf

Text Generation • 24B • Updated May 24 • 74

tifin-india/sarvam-m-24b-q4-k-m-gguf

Text Generation • 24B • Updated May 24 • 34 • 1

tifin-india/sarvam-m-24b-q3-k-m-gguf

Text Generation • 24B • Updated May 24 • 12

tifin-india/sarvam-m-24b-q4-k-s-gguf

Text Generation • 24B • Updated May 24 • 53

tifin-india/sarvam-m-24b-q5-k-m-gguf

Text Generation • 24B • Updated May 24 • 110 • 2

ykarout/MiMo-VL-7B-SFT-GGUF

Image-Text-to-Text • 8B • Updated Jun 2 • 131

XythicK/Qwen.Qwen2.5-Math-1.5B-GGUF

2B • Updated about 1 month ago • 68

Govind222/Koyna-V2-1b-instruct-GGUF

1.0B • Updated 30 days ago

agentlans/SmolLM2-135M-Instruct-GGUF

0.1B • Updated 29 days ago • 81

ReallyFloppyPenguin/Holo1-3B-GGUF

3B • Updated 25 days ago • 250 • 2

mgonzs13/SpaceOm-GGUF

Image-Text-to-Text • 3B • Updated 24 days ago • 386 • 1

Darkhn/L3.3-70B-Animus-V1-GGUF

71B • Updated 19 days ago • 482

allura-quants/allura-org_Q3-8B-Kintsugi-GGUF

Updated 21 days ago

ReallyFloppyPenguin/sarvam-m-GGUF

24B • Updated 21 days ago • 483 • 1

ReallyFloppyPenguin/DeepSeek-R1-0528-Qwen3-8B-GGUF

8B • Updated 21 days ago • 416

ReallyFloppyPenguin/MiniCPM4-8B-GGUF

8B • Updated 21 days ago • 55

ReallyFloppyPenguin/Nemotron-Research-Reasoning-Qwen-1.5B-GGUF

2B • Updated 21 days ago • 160 • 1

ReallyFloppyPenguin/OpenCodeReasoning-Nemotron-14B-GGUF

15B • Updated 19 days ago • 97 • 1

ReallyFloppyPenguin/Jan-nano-GGUF

4B • Updated 19 days ago • 111

ReallyFloppyPenguin/Qwen2.5-Math-7B-GGUF

Updated 19 days ago

ReallyFloppyPenguin/Qwen3-0.6B-GGUF

0.8B • Updated 19 days ago • 86

ReallyFloppyPenguin/Holo1-7B-GGUF

8B • Updated 19 days ago • 101

ReallyFloppyPenguin/DeepSeek-R1-Distill-Qwen-32B-GGUF

33B • Updated 18 days ago • 45

ReallyFloppyPenguin/Gemma-3-Gaia-PT-BR-4b-it-GGUF

4B • Updated 18 days ago • 101

ReallyFloppyPenguin/Qwen3-30B-A3B-GGUF

31B • Updated 17 days ago • 28