Maybe need requant and IQ3_S models?

by Cran-May - opened May 1, 2024

May 1, 2024

as title.

May 1, 2024

IQ3_S is just fit for 4GB VRAM devices running 8B models.(IQ3_M is best for 7B models.)

Owner May 1, 2024

I'd need to try to redo these quants in the latest llamacpp and if do I'll include the IQ3_S.

Owner May 2, 2024

•

These will be reuploaded with the new llamacpp version.

Lewdiculous changed discussion status to closed May 2, 2024

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment