Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
SimmonsSongHW
/
Llama-3.1-8B-Instruct-GGUF
like
0
GGUF
conversational
License:
apache-2.0
Model card
Files
Files and versions
Community
Deploy
Use this model
Llama-3.1-8B-Instruct Quantization with Llama.cpp
Downloads last month
45
GGUF
Model size
8.03B params
Architecture
llama
Hardware compatibility
Log In
to view the estimation
2-bit
Q2_K
3.18 GB
3-bit
Q3_K
4.02 GB
4-bit
Q4_0
4.66 GB
Q4_K
4.92 GB
5-bit
Q5_0
5.6 GB
Q5_K
5.73 GB
6-bit
Q6_K
6.6 GB
8-bit
Q8_0
8.54 GB
View +1 variant
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
๐
Ask for provider support
HF Inference deployability: The model has no library tag.
Model tree for
SimmonsSongHW/Llama-3.1-8B-Instruct-GGUF
Base model
meta-llama/Llama-3.1-8B
Finetuned
meta-llama/Llama-3.1-8B-Instruct
Quantized
(
388
)
this model
Collection including
SimmonsSongHW/Llama-3.1-8B-Instruct-GGUF
Llama3-Quants
Collection
3 items
โข
Updated
26 days ago