what is the context length of this model ?

by YairFr - opened Aug 31, 2023

Discussion

YairFr

Aug 31, 2023

is it 4K or 16K ?

TheBloke

Owner Aug 31, 2023

•

edited Aug 31, 2023

The base CodeLlama was trained to 16K. It can theoretically be used at longer context lengths as well, and of course shorter if you want

WizardCoder specifically I think they fine tuned at 4K. But the model will still support 16K - though how well it will cope with the fine tuning at that length, I don't know

supercharge19

Sep 12, 2023

Is there anyway to get longer context with gguf files? I heard that llama cpp supports longer context but you need to do some configuration (which I do not know yet).

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment