Smaller quants

#1
by RogerS-01 - opened

Hi Harry,

Would it be possible to get some smaller quants of this model to save some VRAM, like for instance Q4_K_S, Q4_K_M, i1-Q4_K_S or i1-Q4_K_M?

Alternatively could you provide a link to the original SafeTensors model somewhere?

Sign up or log in to comment