metadata
library_name: transformers
pipeline_tag: text-generation
tags:
- 12b
- 4-bit
- Q4_K_M
- gguf
- llama-cpp
- nemo
- saiga
- text-generation
roleplaiapp/saiga_nemo_12b_gguf-Q4_K_M-GGUF
Repo: roleplaiapp/saiga_nemo_12b_gguf-Q4_K_M-GGUF
Original Model: saiga_nemo_12b_gguf
Quantized File: saiga_nemo_12b.Q4_K_M.gguf
Quantization: GGUF
Quantization Method: Q4_K_M
Overview
This is a GGUF Q4_K_M quantized version of saiga_nemo_12b_gguf
Quantization By
I often have idle GPUs while building/testing for the RP app, so I put them to use quantizing models. I hope the community finds these quantizations useful.
Andrew Webby @ RolePlai.