GGML Version
#1
by
s3nh
- opened
:) https://huggingface.co/s3nh/mamba-gpt-3b-v3-GGML
did not check if there is significant drop of accuracy, but there exist a visible speed up in inference time.
All best,
Damian