TehVenom
/

Pygmalion-7b-4bit-GPTQ-Safetensors

Text Generation

text generation

Model card Files Files and versions Community

TehVenom commited on Apr 30, 2023

Commit

0e0e22a

•

1 Parent(s): 351638e

Update README.md

Files changed (1) hide show

README.md +4 -2

README.md CHANGED Viewed

@@ -22,9 +22,11 @@ This is version 1. It has been fine-tuned using a subset of the data from Pygmal
 This models has the XOR files pre-applied out of the box.
 Converted from the XORs weights from PygmalionAI's release https://huggingface.co/PygmalionAI/pygmalion-7b
-It has also been quantized down to 4Bit using the GPTQ library available here: https://github.com/oobabooga/GPTQ-for-LLaMa
 ```
-python llama.py .\Pygmalion-7b-Merged-Safetensors c4 --wbits 4 --true-sequential --groupsize 32 --save_safetensors Pygmalion-7B-GPTQ-4bit-32g.no-act-order.safetensors
 ```
 ## Prompting

 This models has the XOR files pre-applied out of the box.
 Converted from the XORs weights from PygmalionAI's release https://huggingface.co/PygmalionAI/pygmalion-7b
+Quantization was done using https://github.com/0cc4m/GPTQ-for-LLaMa for use in KoboldAI
+Via the following command:
 ```
+python llama.py ./TehVenom_Pygmalion-7b-Merged-Safetensors c4 --wbits 4 --true-sequential --groupsize 32 --save_safetensors Pygmalion-7B-GPTQ-4bit-32g.no-act-order.safetensors
 ```
 ## Prompting