A newer version of this model is available: Qwen/Qwen2.5-0.5B

Brianpuze/Qwen2-0.5B-Q4_K_M-Q4_0-GGUF

This repo contains GGUF quantized versions of Qwen/Qwen2-0.5B using llama.cpp.

Quantized Versions:

llama-cli --hf-repo Brianpuze/Qwen2-0.5B-Q4_K_M-Q4_0-GGUF --hf-file qwen2-0.5b-q4_k_m.gguf -p

GGUF

Model size

494M params

Architecture

qwen2

Hardware compatibility

4-bit

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Base model

Qwen/Qwen2-0.5B

Quantized

(22)

this model