M3.2-24B-Loki-V1.0-GGUF
GGUF model files for M3.2-24B-Loki-V1.0
.
This repository contains GGUF models quantized using llama.cpp
.
- Base Model: CrucibleLab-TG/M3.2-24B-Loki-V1.0
- Quantization Methods Processed in this Job:
BF16
,Q6_K
,Q5_K_M
,Q5_K_S
,Q5_0
,Q4_K_M
,Q4_K_S
,Q4_0
,Q3_K_L
,Q3_K_M
,Q3_K_S
,Q2_K
,Q8_0
- Importance Matrix Used: No
This specific upload is for the Q8_0
quantization.
- Downloads last month
- 430
Hardware compatibility
Log In
to view the estimation
2-bit
3-bit
4-bit
5-bit
6-bit
8-bit
16-bit
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
๐
Ask for provider support