Edit Models filters

Apps

Docker Model Runner

Inference Providers

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

189

Full-text search

Active filters: torchao

Vykyan/FuseO1-DeepSeekR1-Qwen2.5-Coder-32B-Preview-ao-int4wo-gs128

Updated 19 days ago • 11

jcaip/fp8fnuz-opt-125m

Text Generation • Updated 18 days ago • 16

appy1234/Phi-4-mini-instruct-float8dq

Text Generation • Updated 15 days ago • 29

poinka/gemma-3-4b-pt-q8bits

Updated 11 days ago • 15

kexve/DeepSeek-R1-Distill-Qwen-1.5B-torchao-int8_weight_only

Updated 6 days ago • 7

vkuzometa/fp8-opt-125m

Text Generation • Updated 6 days ago • 4

torchao-testing/opt-125m-int4wo-preshuffle

Text Generation • Updated 6 days ago • 155

torchao-testing/opt-125m-float8dq-row-fbgemm

Text Generation • Updated 6 days ago • 189

Bbboris1234/Phi-4-int4wo-hqq

Text Generation • Updated 4 days ago • 10