[Cache Request] meta-llama/Meta-Llama-3-8B

#71

by sandkoan - opened May 6, 2024

Discussion

sandkoan

May 6, 2024

Please add the following model to the neuron cache

pagezyhf

AWS Inferentia and Trainium org May 7, 2024

Hello sandkoan,
Thank you for opening a cache request. This model is already supported in the cache. You can see the configurations supported in this json file.

sandkoan changed discussion status to closed May 8, 2024

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment