Qwen3-4B-Instruct-2507-NEO-Imatrix-PlayGround-GGUF

16 Neo Imatrix experimental configs (4B and 6B ggufs) for new Qwen 3 4B 2507 Instruct and "Thinking" 2507 too.

All quants work.

This repo is a test bed for each quant/config type, which will be followed by a single repo per winning "configs".

Repo contains a "baseline" also of Instruct and Thinking as a reference - no "NEO" / 20-2 in the filename.

6 Quants at 6B parameters/55 layers/607 tensors (3 Instruct, 3 Thinking) of Brainstorm 20x (denoted by 20-2 in the file name) too.

All quants: 256k context.

Model Sources [read more about model, settings and so...]:

https://huggingface.co/Qwen/Qwen3-4B-Instruct-2507

https://huggingface.co/Qwen/Qwen3-4B-Thinking-2507

Downloads last month
1,885
GGUF
Model size
5.94B params
Architecture
qwen3
Hardware compatibility
Log In to view the estimation

4-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for DavidAU/Qwen3-4B-Instruct-2507-NEO-Imatrix-PlayGround-GGUF

Quantized
(44)
this model