original model gone / output bad for 16k context?

by mclassHF2023 - opened May 9, 2024

mclassHF2023

May 9, 2024

•

edited May 9, 2024

It seems the original unquantized model is gone?
Also, with all wizardLM-2 7B models, I notice that at (maybe before) 16k context, the output is just garbage, things like:
1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1
or even simply newlines, no real output at all.

mclassHF2023 changed discussion title from original model gone? to original model gone / output bad for 16k context? May 9, 2024

mradermacher

Owner May 9, 2024

It seems it is gone. As is the original wizardlm-2.

At least with smaller context it works fine though so it's not broken.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment