QuantFlex Banner

GGUF Quants for: huihui-ai/SmallThinker-3B-Preview-abliterated

Model by: huihui-ai (thank you!)

Quants by: quantflex

Run with llama.cpp:

./llama-cli -m SmallThinker-3B-Preview-abliterated-Q5_K_M.gguf -p 'You are a helpful assistant.' --temp 0.7 --top-p 0.8 --top-k 20 --repeat-penalty 1.1 -cnv --chat-template chatml

Downloads last month
239
GGUF
Model size
3.09B params
Architecture
qwen2

3-bit

4-bit

5-bit

6-bit

8-bit

16-bit

32-bit

Inference Examples
Unable to determine this model's library. Check the docs .

Model tree for quantflex/SmallThinker-3B-Preview-abliterated-GGUF