This contains q4_0, q4_1, q5_0 and q5_1 GGML(v3) and GGUF(v2) quantizations of the model https://huggingface.co/CobraMamba/mamba-gpt-3b-v4

Downloads last month
1,410
GGUF
Model size
3.43B params
Architecture
llama
Hardware compatibility
Log In to view the estimation

4-bit

5-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support