Fix tokenizer model_max_length to match model config (8192)

#11

by faridlazuarda - opened 9 days ago

base: refs/heads/main

←

from: refs/pr/11

Discussion Files changed

-1

faridlazuarda

9 days ago

This PR updates tokenizer_config.json to reflect the correct context length (8192) supported by the underlying Gemma-2 base model.

The original value (2048) causes truncation errors and warnings when handling sequences above 2k tokens, despite the model supporting up to 8192 tokens (as stated in the Gemma-2 model documentation and the model's own config.json).

Fix tokenizer model_max_length to match model config (8192)1303e7de

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Ready to merge

This branch is ready to get merged automatically.

· Sign up or log in to comment