Brianpuze/Qwen2.5-0.5B-Q4_K_M-GGUF

This repo contains GGUF quantized versions of Qwen/Qwen2.5-0.5B using llama.cpp.

Quantized Versions:

llama-cli --hf-repo Brianpuze/Qwen2.5-0.5B-Q4_K_M-GGUF --hf-file qwen2.5-0.5b-q4_k_m.gguf -p

GGUF

Model size

494M params

Architecture

qwen2

Hardware compatibility

4-bit

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Base model

Quantized

(60)

this model