Hi Harry,
Would it be possible to get some smaller quants of this model to save some VRAM, like for instance Q4_K_S, Q4_K_M, i1-Q4_K_S or i1-Q4_K_M?
Alternatively could you provide a link to the original SafeTensors model somewhere?
· Sign up or log in to comment