We found that standard quantization significantly degrades quality and can break the model's reasoning chains. For the best results, we highly recommend running it in BF16 or FP8.

'Make knowledge free for everyone'

Made with

Downloads last month: 78

GGUF

Model size

32.8B params

Architecture

qwen3

Hardware compatibility

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

16-bit

Inference Providers NEW

Text Generation

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for DevQuasar/Tesslate.UIGEN-T3-32B-Preview-GGUF

Base model

Qwen/Qwen3-32B

Finetuned

Tesslate/UIGEN-T3-32B-Preview

Quantized

(3)

this model