This repo currently contains the version of generation_config.json from Llama 3 8B Instruct that declares both 128001 and 128009 to be eos tokens. This file can be used to "repair" both full weight models and exl2 quants thereto. Just drop a copy of the file in the same directory as the safetensors files.
Inference Providers
NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API:
The model has no library tag.