Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
Nebius AI Studio
Novita
Nscale
Replicate
fal
Fireworks
Cerebras
Together AI
Cohere
Hyperbolic
SambaNova
HF Inference API
Misc
Reset Misc
torchao
Inference Endpoints
text-generation-inference
custom_code
4-bit precision
Merge
Misc with no match
Eval Results
8-bit precision
text-embeddings-inference
Carbon Emissions
Mixture of Experts
Apply filters
Models
138
Full-text search
Edit filters
Sort: Trending
Active filters:
torchao
Clear all
gurro/llama-3.1-8B-torchao-int4wo-128
Text Generation
•
Updated
Dec 2, 2024
•
9
gurro/llama-3.1-8B-torchao-int4wo-256
Text Generation
•
Updated
Dec 2, 2024
•
10
jerryzh168/llama3-8b-autoquant
Text Generation
•
Updated
Feb 19
•
15
medmekk/Llama-3.1-8B-Instruct-torchao-int8_weight_only
Updated
Jan 8
•
1
medmekk/Llama-3.1-8B-Instruct-torchao-int8wo
Updated
Jan 8
•
1
medmekk/Llama-3.1-8B-Instruct-torchao-int8da8w
Updated
Jan 8
medmekk/Llama-3.2-3B-Instruct-torchao-int8wo
Updated
Jan 8
•
3
medmekk/Llama-3.2-1B-torchao-int8wo
Updated
Jan 8
•
5
medmekk/Llama-3.2-1B-torchao-int8da8w
Updated
Jan 8
•
1
medmekk/Llama-3.2-3B-Instruct-torchao-int8da8w
Updated
Jan 8
medmekk/Llama-3.1-70B-Instruct-torchao-int8da8w
Updated
Jan 8
•
2
jerryzh168/Meta-Llama-3-8B-torchao-int8_weight_only
Updated
Jan 13
jerryzh168/Meta-Llama-3-8B-torchao-int4_weight_only-gs_128
Updated
Jan 13
•
1
jerryzh168/Meta-Llama-3-8B-torchao-int4_weight_only-gs_64
Updated
Jan 13
HF-Quantization/Llama-3.2-1B-TORCHAO-W8
Updated
Jan 21
•
1
HF-Quantization/Llama-3.2-1B-TORCHAO-W8A8
Updated
Jan 21
•
1
HF-Quantization/Llama-3.2-1B-TORCHAO-W4
Updated
Jan 21
•
1
HF-Quantization/Llama-3.3-70B-Instruct-TORCHAO-W4
Updated
Jan 22
•
3
jpablomch/Meta-Llama-3-8B-Instruct-torchao
Text Generation
•
Updated
Feb 19
•
7
jerryzh168/llama3-8b-int4wo-128
Text Generation
•
Updated
Feb 21
•
5
jerryzh168/llama3-8b-int8wo
Text Generation
•
Updated
Feb 27
•
6
alpindale/Meta-Llama-3-8B-torchao-int8_weight_only
Updated
Mar 2
drisspg/f8a8-opt-125m
Text Generation
•
Updated
Mar 4
•
5
drisspg/f8a8-opt-125m_2
Text Generation
•
Updated
Mar 5
•
4
drisspg/float8_dynamic_act_float8_weight-opt-125m
Text Generation
•
Updated
Mar 19
•
41
marksaroufim/Meta-Llama-3-8B-torchao-int8_weight_only
Updated
Mar 20
•
3
jerryzh168/llama3-int8wo
Text Generation
•
Updated
Mar 20
•
4
jerryzh168/llama3-int4wo
Text Generation
•
Updated
Mar 21
•
4
jerryzh168/gemma3-8da4w
Image-Text-to-Text
•
Updated
Mar 25
•
2
jerryzh168/gemma3-4b-it-float8dq
Image-Text-to-Text
•
Updated
Mar 26
•
2
Previous
1
2
3
4
5
Next