I'm a little late... I guess.

Link to original model and script:

Downloads last month
81
GGUF
Model size
12.2B params
Architecture
llama
Hardware compatibility
Log In to view the estimation

3-bit

4-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support