Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
Hyperbolic
Nebius AI Studio
Cerebras
Cohere
Replicate
Fireworks
SambaNova
Novita
Nscale
fal
Together AI
HF Inference API
Misc
Reset Misc
torchao
Inference Endpoints
text-generation-inference
custom_code
4-bit precision
Merge
Misc with no match
Eval Results
8-bit precision
text-embeddings-inference
Carbon Emissions
Mixture of Experts
Apply filters
Models
138
Full-text search
Edit filters
Sort: Trending
Active filters:
torchao
Clear all
GooKSL/ccv_int8wo
Updated
15 days ago
•
38
andysalerno/Qwen3-8B-ao-autoquant
Text Generation
•
Updated
15 days ago
•
4
andrewor14/Llama-3.1-8B-Instruct-float8dq
Text Generation
•
Updated
15 days ago
•
8
HexLang/GPT2
Updated
10 days ago
•
5
CJHauser/vibrance
Text2Text Generation
•
Updated
10 days ago
•
22
metascroy/Qwen3-4B-untied-8da4w-vllm-test
Text Generation
•
Updated
9 days ago
•
20
GingerBled/DPO-Quantized_8bit_mock1
Text Generation
•
Updated
3 days ago
•
38
bziemba/qwen3-0.6B-torchao-int820250521_190819
Text Generation
•
Updated
about 19 hours ago
•
41
GingerBled/DPO-Quantized_8bit_mock2
Text Generation
•
Updated
3 days ago
•
3
sajal09/MNLP_M2_quantized_model2
Text Generation
•
Updated
2 days ago
•
65
Erland/softpick-1.8B-4096-model-AO-W4A4
Text Generation
•
Updated
2 days ago
•
5
Erland/softpick-1.8B-4096-model-AO-W4
Text Generation
•
Updated
2 days ago
•
7
Erland/vanilla-1.8B-4096-model-AO-W4A4
Text Generation
•
Updated
2 days ago
•
5
Erland/vanilla-1.8B-4096-model-AO-W4
Text Generation
•
Updated
2 days ago
•
7
Cloudmaster/Llama-3.2-3B-torchao
Text Generation
•
Updated
1 day ago
•
33
Jiqing/cuda_torchao_llama_68m
Updated
1 day ago
•
2
Cloudmaster/Llama-3.2-3B-torchao-int4
Text Generation
•
Updated
1 day ago
•
19
Cloudmaster/Llama-3.2-3B-torchao-int4-t4
Text Generation
•
Updated
about 23 hours ago
•
3
Previous
1
...
3
4
5
Next