Smaller quants

by RogerS-01 - opened 11 days ago

11 days ago

Hi Harry,

Would it be possible to get some smaller quants of this model to save some VRAM, like for instance Q4_K_S, Q4_K_M, i1-Q4_K_S or i1-Q4_K_M?

Alternatively could you provide a link to the original SafeTensors model somewhere?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment