Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Efficient-ML
's Collections
Qwen3-Quantization
LLaMA3-Quantization
Qwen3-Quantization
updated
5 days ago
This is the official quantized models collection of Qwen3 Quantization
Upvote
5
Efficient-ML/Qwen3-0.6B-base-gptq-w4-128
Updated
7 days ago
Efficient-ML/Qwen3-0.6B-base-gptq-w8-128
Updated
7 days ago
Efficient-ML/Qwen3-0.6B-base-gptq-w8-perchannel
Updated
7 days ago
Efficient-ML/Qwen3-0.6B-base-gptq-w4-perchannel
Updated
7 days ago
Efficient-ML/Qwen3-1.7B-base-gptq-w4-128
Updated
7 days ago
Efficient-ML/Qwen3-1.7B-base-gptq-w4-perchannel
Updated
7 days ago
Efficient-ML/Qwen3-1.7B-base-gptq-w8-128
Updated
7 days ago
Efficient-ML/Qwen3-1.7B-base-gptq-w8-perchannel
Updated
7 days ago
Efficient-ML/Qwen3-4B-base-gptq-w4-128
Updated
7 days ago
Efficient-ML/Qwen3-4B-base-gptq-w8-128
Updated
7 days ago
Efficient-ML/Qwen3-4B-base-gptq-w8-perchannel
Updated
7 days ago
Efficient-ML/Qwen3-4B-base-gptq-w4-perchannel
Updated
7 days ago
Efficient-ML/Qwen3-8B-base-gptq-w4-128
Updated
7 days ago
Efficient-ML/Qwen3-8B-base-gptq-w8-128
Updated
6 days ago
Efficient-ML/Qwen3-8B-base-gptq-w4-perchannel
Updated
7 days ago
Efficient-ML/Qwen3-8B-base-gptq-w8-perchannel
Updated
6 days ago
Efficient-ML/Qwen3-14B-base-gptq-w4-128
Updated
5 days ago
Efficient-ML/Qwen3-14B-base-gptq-w4-perchannel
Updated
5 days ago
Efficient-ML/Qwen3-14B-base-gptq-w8-128
Updated
5 days ago
Efficient-ML/Qwen3-14B-base-gptq-w8-perchannel
Updated
5 days ago
Efficient-ML/Qwen3-0.6B-gptq-w8-128
Updated
5 days ago
Efficient-ML/Qwen3-0.6B-gptq-w4-128
Updated
5 days ago
Efficient-ML/Qwen3-0.6B-gptq-w4-perchannel
Updated
5 days ago
Efficient-ML/Qwen3-0.6B-gptq-w8-perchannel
Updated
5 days ago
Efficient-ML/Qwen3-1.7B-gptq-w4-128
Updated
5 days ago
Efficient-ML/Qwen3-1.7B-gptq-w4-perchannel
Updated
5 days ago
Efficient-ML/Qwen3-1.7B-gptq-w8-128
Updated
5 days ago
Efficient-ML/Qwen3-1.7B-gptq-w8-perchannel
Updated
5 days ago
Efficient-ML/Qwen3-4B-gptq-w4-128
Updated
5 days ago
Efficient-ML/Qwen3-4B-gptq-w4-perchannel
Updated
5 days ago
Efficient-ML/Qwen3-4B-gptq-w8-128
Updated
5 days ago
Efficient-ML/Qwen3-4B-gptq-w8-perchannel
Updated
5 days ago
Efficient-ML/Qwen3-8B-gptq-w4-128
Updated
5 days ago
Efficient-ML/Qwen3-8B-gptq-w4-perchannel
Updated
5 days ago
Efficient-ML/Qwen3-8B-gptq-w8-128
Updated
5 days ago
Efficient-ML/Qwen3-8B-gptq-w8-perchannel
Updated
5 days ago
Efficient-ML/Qwen3-14B-gptq-w4-perchannel
Updated
5 days ago
Efficient-ML/Qwen3-14B-gptq-w4-128
Updated
5 days ago
Efficient-ML/Qwen3-14B-gptq-w8-128
Updated
5 days ago
Efficient-ML/Qwen3-14B-gptq-w8-perchannel
Updated
5 days ago
An Empirical Study of Qwen3 Quantization
Paper
•
2505.02214
•
Published
7 days ago
•
23
Efficient-ML/Qwen3-awq
Updated
5 days ago
Upvote
5
+1
Share collection
View history
Collection guide
Browse collections