What is the context size?

#16

by fatshady - opened Mar 7, 2024

Mar 7, 2024

What is the context size and is there any way to extend it?

May 24, 2024

•

The paper says "However, we note that the Aya model is finetuned using up to 1024 input tokens as in mT5 pretraining, ...."
Section 5.1.2, Page 17

alexrs changed discussion status to closed Jun 12

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment