SolidSnacke
/

Llama-3-Soliloquy-8B-v1.5-64k-i-GGUF

Text Generation

text-generation-inference

Model card Files Files and versions Community

An updated version of the previous model. In this one, I have not yet found any problems with word duplication.

02.05.24 Model updates, new versions are in the v1.1 branch.

Link to original model and script:

openlynn/Llama-3-Soliloquy-8B-v1.5-64k: https://huggingface.co/openlynn/Llama-3-Soliloquy-8B-v1.5-64k
FantasiaFoundry/GGUF-Quantization-Script: https://huggingface.co/FantasiaFoundry/GGUF-Quantization-Script

Downloads last month: 9

GGUF

Model size

8.03B params

Architecture

llama

Hardware compatibility

Log In to view the estimation

4-bit

5-bit

6-bit

Inference Providers NEW

Text Generation

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support