Edit Models filters

Inference Providers

Nebius AI Studio

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

Misc with no match

8-bit precision

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

138

Full-text search

Active filters: torchao

GooKSL/ccv_int8wo

Updated 15 days ago • 38

andysalerno/Qwen3-8B-ao-autoquant

Text Generation • Updated 15 days ago • 4

andrewor14/Llama-3.1-8B-Instruct-float8dq

Text Generation • Updated 15 days ago • 8

HexLang/GPT2

Updated 10 days ago • 5

CJHauser/vibrance

Text2Text Generation • Updated 10 days ago • 22

metascroy/Qwen3-4B-untied-8da4w-vllm-test

Text Generation • Updated 9 days ago • 20

GingerBled/DPO-Quantized_8bit_mock1

Text Generation • Updated 3 days ago • 38

bziemba/qwen3-0.6B-torchao-int820250521_190819

Text Generation • Updated about 19 hours ago • 41

GingerBled/DPO-Quantized_8bit_mock2

Text Generation • Updated 3 days ago • 3

sajal09/MNLP_M2_quantized_model2

Text Generation • Updated 2 days ago • 65

Erland/softpick-1.8B-4096-model-AO-W4A4

Text Generation • Updated 2 days ago • 5

Erland/softpick-1.8B-4096-model-AO-W4

Text Generation • Updated 2 days ago • 7

Erland/vanilla-1.8B-4096-model-AO-W4A4

Text Generation • Updated 2 days ago • 5

Erland/vanilla-1.8B-4096-model-AO-W4

Text Generation • Updated 2 days ago • 7

Cloudmaster/Llama-3.2-3B-torchao

Text Generation • Updated 1 day ago • 33

Jiqing/cuda_torchao_llama_68m

Updated 1 day ago • 2

Cloudmaster/Llama-3.2-3B-torchao-int4

Text Generation • Updated 1 day ago • 19

Cloudmaster/Llama-3.2-3B-torchao-int4-t4

Text Generation • Updated about 23 hours ago • 3