Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
Cohere
Fireworks
Novita
Replicate
Cerebras
Nebius AI Studio
SambaNova
Nscale
Hyperbolic
Together AI
fal
HF Inference API
Misc
Reset Misc
Quantized
Inference Endpoints
Misc with no match
text-generation-inference
Eval Results
Merge
4-bit precision
custom_code
8-bit precision
text-embeddings-inference
Carbon Emissions
Mixture of Experts
Apply filters
Models
74
Full-text search
Edit filters
Sort: Trending
Active filters:
Quantized
Clear all
VPTQ-community/Llama-3.1-Nemotron-70B-Instruct-HF-v16-k65536-256-woft
Updated
Feb 25
•
5
•
1
VPTQ-community/Meta-Llama-3.3-70B-Instruct-v8-k65536-256-woft
Updated
Feb 25
•
10
VPTQ-community/Meta-Llama-3.3-70B-Instruct-v16-k65536-16384-woft
Updated
Feb 25
•
7
VPTQ-community/Meta-Llama-3.3-70B-Instruct-v8-k65536-0-woft
Updated
Feb 25
•
5
VPTQ-community/Meta-Llama-3.3-70B-Instruct-v16-k65536-65536-woft
Updated
Feb 25
•
15
VPTQ-community/Meta-Llama-3.3-70B-Instruct-v8-k65536-65536-woft
Updated
Feb 25
•
7
•
1
VPTQ-community/Meta-Llama-3.3-70B-Instruct-v16-k65536-1024-woft
Updated
Feb 25
•
12
•
1
VPTQ-community/Meta-Llama-3.1-8B-Instruct-v12-k65536-4096-woft-vllm
Updated
Jan 13
•
1
VPTQ-community/deepseek-r1_v_8_k_65536_256_mp4
Updated
Mar 12
•
8
VPTQ-community/deepseek-r1_v_8_k_65536_mixed_mp4
Updated
Mar 12
•
10
•
2
VPTQ-community/deepseek-r1_v8_k_65536_mp4
Updated
Mar 12
•
19
VPTQ-community/deepseek-r1_v_8_k_65536
Updated
Mar 12
•
7
VPTQ-community/deepseek-r1_v_8_k_65536_256
Updated
Mar 12
•
20
swayamsingal/NanoQuant
Text Generation
•
Updated
Apr 14
•
1
Previous
1
2
3
Next