--- library_name: transformers pipeline_tag: text-generation tags: - 12b - 4-bit - Q4_K_M - gguf - llama-cpp - nemo - saiga - text-generation --- # roleplaiapp/saiga_nemo_12b_gguf-Q4_K_M-GGUF **Repo:** `roleplaiapp/saiga_nemo_12b_gguf-Q4_K_M-GGUF` **Original Model:** `saiga_nemo_12b_gguf` **Quantized File:** `saiga_nemo_12b.Q4_K_M.gguf` **Quantization:** `GGUF` **Quantization Method:** `Q4_K_M` ## Overview This is a GGUF Q4_K_M quantized version of saiga_nemo_12b_gguf ## Quantization By I often have idle GPUs while building/testing for the RP app, so I put them to use quantizing models. I hope the community finds these quantizations useful. Andrew Webby @ [RolePlai](https://roleplai.app/).