ReadyArt
/

Qwen2.5-QwQ-RP-Draft-v0.2-1.5B-AWQ

Text Generation

text-generation-inference

4-bit precision

Model card Files Files and versions

FrenzyBiscuit

AWQ Details

Model was quantized down to INT4 using GEMM Kernels.
Zero point quantization
Group size of 64

Downloads last month: 11

Safetensors

Model size

420M params

Tensor type

I32

·

BF16

·

F16

·

Inference Providers NEW

Text Generation

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for ReadyArt/Qwen2.5-QwQ-RP-Draft-v0.2-1.5B-AWQ

Base model

Qwen/Qwen2.5-1.5B

Finetuned

Qwen/Qwen2.5-1.5B-Instruct

Finetuned

BeaverAI/Qwen2.5-QwQ-RP-Draft-v0.2-1.5B

Quantized

(9)

this model

Collection including ReadyArt/Qwen2.5-QwQ-RP-Draft-v0.2-1.5B-AWQ

AWQ Quants

Quants by FrenzyBiscuit & Artus • 22 items • Updated 26 days ago • 1