Real model context length?

by gregporter585 - opened 11 days ago

11 days ago

In your model card, you advertise Context length of 131,072 tokens. However, the model is limited to 32,768 as defined my max_position_embeddings in the config.json.

I noticed rope_scaling was null. Is RoPE scalling supported in order to actually use the full 131,072 tokens? If so, what factor do you suggest using?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment