This is Qwen/Qwen3-32B quantized with AutoRound in 4-bit (symmetric + gptq format). The model has been created, tested, and evaluated by The Kaitchup. The model is compatible with vLLM and Transformers.

More details in this article: How Well Does Qwen3 Handle 4-bit and 2-bit Quantization?

Developed by: The Kaitchup
License: Apache 2.0 license

How to Support My Work

Subscribe to The Kaitchup. This helps me a lot to continue quantizing and evaluating models for free.

Downloads last month: 2,151

Safetensors

Model size

5.74B params

Tensor type

I32

BF16

F16

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for kaitchup/Qwen3-32B-autoround-4bit-gptq

Base model

Qwen/Qwen3-32B

Quantized

(92)

this model

Collection including kaitchup/Qwen3-32B-autoround-4bit-gptq

Quantized Qwen3

Collection

13 items • Updated 29 days ago • 1