Possible to do IQ2_M and IQ3_S/IQ3_M?

by Panchovix - opened 3 days ago

Discussion

Panchovix

3 days ago

Hi there, many thanks for your work!

It is possible to do these quants?

For example on Qwen 3 30B-A3B

There is a possiblity that IQ2_M would perform equally as Q2_K_XL for same or less size, and IQ3_S/IQ3_M compared to Q3_K_M or Q3_K_XL.

shimmyshimmer

Unsloth AI org 3 days ago

Benchmarks are always subjective and inconclusive unless done in super controlled environments. It's unlikely that a smaller model would perform much better than larger quants and might just be noise.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment