TobDeBer
/

SmartQuant

Model card Files Files and versions

Ctrl+K

Ctrl+K

3 contributors

History: 13 commits

TobDeBer's picture

Upload llama-server-6343-cuda with huggingface_hub

b02ea94 verified 10 days ago

.gitattributes

2.07 kB

Upload llama-server-6343-cuda with huggingface_hub 10 days ago
README.md

405 Bytes

Update README.md 5 months ago
SmartQuant-Falcon-H1-0.5B-Instruct.gguf

275 MB
xet

Upload SmartQuant-Falcon-H1-0.5B-Instruct.gguf with huggingface_hub 2 months ago
SmartQuant-Llama-3.3-70B-Instruct.gguf

21 GB
xet

Rename Llama-3.3-70B-Instruct-SmartQuant.gguf to SmartQuant-Llama-3.3-70B-Instruct.gguf 5 months ago
SmartQuant-granite-3.3-8b-instruct.gguf

5.84 GB
xet

Rename granite-3.3-8b-instruct-SmartQuant.gguf to SmartQuant-granite-3.3-8b-instruct.gguf 5 months ago
Tiny-Moe.Q6_K_T3.gguf

84.7 MB
xet

Upload Tiny-Moe.Q6_K_T3.gguf with huggingface_hub 18 days ago
calibration_datav3.txt

280 kB

add quantization tool 5 months ago
llama-quantize

2.78 MB
xet

add quantization tool 5 months ago
llama-server-6343-cuda

321 MB
xet

Upload llama-server-6343-cuda with huggingface_hub 10 days ago