Fix the bug in generation config

#11

by SivilTaram - opened Apr 9

base: refs/heads/main

←

from: refs/pr/11

Discussion Files changed

-1

Fix the bug in generation configdfb80945

SivilTaram

Apr 9

If we follow the default generation configuration, it does not utilize the key-value cache during inference. This can cause the model to be too slow to generate text efficiently.

RaymondAISG

AI Singapore org Apr 11

Thank you very much for the fix.

RaymondAISG changed pull request status to merged Apr 11

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment