An updated version of the previous model. In this one, I have not yet found any problems with word duplication.

02.05.24 Model updates, new versions are in the v1.1 branch.

Link to original model and script:

Downloads last month
46
GGUF
Model size
8.03B params
Architecture
llama
Hardware compatibility
Log In to view the estimation

4-bit

5-bit

6-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support