Edit Models filters

Apps

Docker Model Runner

Inference Providers

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

2,466

Full-text search

Active filters: quantized

yukihamada/buzzquan-sensei-trained

4B • Updated Jun 18 • 17

ReallyFloppyPenguin/Qwen3-30B-A3B-GGUF

31B • Updated Jun 18 • 12

ReallyFloppyPenguin/II-Medical-8B-1706-GGUF

8B • Updated Jun 20 • 34

vlad-m-dev/distiluse-base-multilingual-v2-merged-onnx

Feature Extraction • Updated Jun 22 • 1

mehta/CooperLM-354M-4bit

Text Generation • 0.2B • Updated Jun 21 • 3 • 1

steampunque/Mistral-Small-3.2-24B-Instruct-2506-Hybrid-GGUF

0.4B • Updated Jun 21 • 20

ReallyFloppyPenguin/Polaris-4B-Preview-GGUF

4B • Updated Jun 23 • 65

ReallyFloppyPenguin/Arch-Agent-7B-GGUF

8B • Updated Jun 23 • 70

ReallyFloppyPenguin/Nanonets-OCR-s-GGUF

3B • Updated Jun 23 • 90

kanrishaurus/llama3-8b-sahabatai-v1-instruct-GGUF

Text Generation • 8B • Updated Jun 23 • 21

steampunque/Qwen2.5-VL-7B-Instruct-Hybrid-GGUF

0.7B • Updated Jun 24 • 7

TheMelonGod/Jan-nano-exl2

Text Generation • Updated Jun 30 • 13

NVFP4/DeepSeek-Prover-V2-7B-FP4

4B • Updated 21 days ago • 8

NVFP4/DeepSeek-R1-0528-Qwen3-8B-FP4

5B • Updated 21 days ago • 39

NVFP4/Polaris-4B-Preview-FP4

2B • Updated 21 days ago • 18

NVFP4/Polaris-7B-Preview-FP4

5B • Updated 21 days ago • 8

hdtrnk/Wan2.1_Phantom_FusioniX

Image-to-Video • Updated Jun 25 • 2

PinkPixel/Crystal-Think-V2-GGUF

Text Generation • 4B • Updated Jun 26 • 15 • 1

PinkPixel/Crystal-Think-V2-Imatrix-GGUF

Text Generation • 4B • Updated Jun 26 • 19 • 1

muranAI/DeepSeek-R1-0528-Qwen3-8B-GGUF

8B • Updated Jun 28 • 301 • 1

hrsvrn/Flux.1-kontext-dev-gguf

12B • Updated Jun 26 • 8

onnx-community/distiluse-base-multilingual-v2-merged-onnx

Feature Extraction • Updated Jun 26 • 1

muranAI/gemma-3n-E4B-it-GGUF

Text Generation • 7B • Updated Jun 28 • 910 • 2

agentlans/gemma-3-4b-it-GGUF

4B • Updated Jun 27 • 16

lym00/Wan2.1_T2V_1.3B_SelfForcing_VACE-GGUF

Image-to-Video • 2B • Updated Jun 28 • 449 • 2

muranAI/Mistral-Small-3.1-24B-Instruct-2503-GGUF

Text Generation • 24B • Updated Jun 28 • 179 • 1

vnyaryan/playwright4model_q4_k_m

Text Generation • 3B • Updated Jun 28 • 9

vnyaryan/playwright5model_q4_k_m

Text Generation • 3B • Updated Jun 28 • 13

agentlans/Qwen3-4B-multilingual-sft

4B • Updated Jun 29 • 4

agentlans/Qwen3-4B-multilingual-sft-GGUF

Text Generation • 4B • Updated Jun 29 • 58