marcelone
/

Jinx-Qwen3-4B-gguf

Model card Files Files and versions

Jinx-Qwen3-4B-gguf / README.md

marcelone's picture

Update README.md

b18facf verified 19 days ago

|

history blame contribute delete

363 Bytes

metadata

license: apache-2.0
base_model: Jinx-org/Jinx-Qwen3-4B
base_model_relation: quantized

Recommended

Jinx-Qwen3-4B-gguf-q6_k-q-8 (mixed-precision): selected weights (output, token embeddings, attention/FFN layers in first and last blocks) quantized to Q8_0, remaining tensors Q6_k, reducing memory footprint while preserving inference fidelity.