https://huggingface.co/NousResearch/Hermes-3-Llama-3.1-405B

#227
by leafspark - opened

NousResearch/Hermes-3-Llama-3.1-405B

Another huge model by NousResearch, interestingly it's a full parameter fine tune on the base.

They also released a 8B and 70B finetune:
https://huggingface.co/NousResearch/Hermes-3-Llama-3.1-70B
https://huggingface.co/NousResearch/Hermes-3-Llama-3.1-8B

What a high-quality model. This is a full parameter finetune of Llama-3.1 405B on par with Llama-3.1 405B Instruct. It uses the ChatML template, supports function calling and has a system prompt to generate structured JSON output.

Queued, but for the 405B, patience is required.

mradermacher changed discussion status to closed
Your need to confirm your account before you can post a new comment.

Sign up or log in to comment