Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
SimmonsSongHW
/
Qwen2.5-3B-Instruct-GGUF
like
0
GGUF
conversational
License:
apache-2.0
Model card
Files
Files and versions
Community
Deploy
Use this model
Qwen2.5-3B-Instruct Quantization with Llama.cpp
Downloads last month
54
GGUF
Model size
3.09B params
Architecture
qwen2
Hardware compatibility
Log In
to view the estimation
2-bit
Q2_K
1.27 GB
3-bit
Q3_K
1.59 GB
4-bit
Q4_0
1.82 GB
Q4_K
1.93 GB
5-bit
Q5_0
2.17 GB
Q5_K
2.22 GB
6-bit
Q6_K
2.54 GB
8-bit
Q8_0
3.29 GB
View +1 variant
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
๐
Ask for provider support
HF Inference deployability: The model has no library tag.
Model tree for
SimmonsSongHW/Qwen2.5-3B-Instruct-GGUF
Base model
Qwen/Qwen2.5-3B
Finetuned
Qwen/Qwen2.5-3B-Instruct
Quantized
(
127
)
this model
Collection including
SimmonsSongHW/Qwen2.5-3B-Instruct-GGUF
Qwen2.5-Quants
Collection
4 items
โข
Updated
25 days ago