TheBloke
/

tulu-30B-GPTQ

Text Generation

text-generation-inference

4-bit precision

Model card Files Files and versions Community

TheBloke commited on Jun 10, 2023

Commit

3d98f4f

·

1 Parent(s): 3dd4495

Update README.md

Files changed (1) hide show

README.md +9 -1

README.md CHANGED Viewed

@@ -21,7 +21,7 @@ license: other
 These files are GPTQ 4bit model files for [Allen AI's Tulu 30B](https://huggingface.co/allenai/tulu-30b).
-It is the result of quantising to 4bit using [GPTQ-for-LLaMa](https://github.com/qwopqwop200/GPTQ-for-LLaMa).
 ## Repositories available
@@ -29,6 +29,14 @@ It is the result of quantising to 4bit using [GPTQ-for-LLaMa](https://github.com
 * [2, 3, 4, 5, 6 and 8-bit GGML models for CPU+GPU inference](https://huggingface.co/TheBloke/tulu-30B-GGML)
 * [Unquantised fp16 model in pytorch format, for GPU inference and for further conversions](https://huggingface.co/TheBloke/tulu-30B-fp16)
 ## How to easily download and use this model in text-generation-webui
 Please make sure you're using the latest version of text-generation-webui

 These files are GPTQ 4bit model files for [Allen AI's Tulu 30B](https://huggingface.co/allenai/tulu-30b).
+It is the result of quantising to 4bit using [AutoGPTQ](https://github.com/PanQiWei/AutoGPTQ).
 ## Repositories available
 * [2, 3, 4, 5, 6 and 8-bit GGML models for CPU+GPU inference](https://huggingface.co/TheBloke/tulu-30B-GGML)
 * [Unquantised fp16 model in pytorch format, for GPU inference and for further conversions](https://huggingface.co/TheBloke/tulu-30B-fp16)
+## Prompt template
+```
+<|user|>
+Your message here!
+<|assistant|>
+```
 ## How to easily download and use this model in text-generation-webui
 Please make sure you're using the latest version of text-generation-webui