Update README.md
Browse files
README.md
CHANGED
@@ -21,7 +21,7 @@ license: other
|
|
21 |
|
22 |
These files are GPTQ 4bit model files for [Allen AI's Tulu 30B](https://huggingface.co/allenai/tulu-30b).
|
23 |
|
24 |
-
It is the result of quantising to 4bit using [
|
25 |
|
26 |
## Repositories available
|
27 |
|
@@ -29,6 +29,14 @@ It is the result of quantising to 4bit using [GPTQ-for-LLaMa](https://github.com
|
|
29 |
* [2, 3, 4, 5, 6 and 8-bit GGML models for CPU+GPU inference](https://huggingface.co/TheBloke/tulu-30B-GGML)
|
30 |
* [Unquantised fp16 model in pytorch format, for GPU inference and for further conversions](https://huggingface.co/TheBloke/tulu-30B-fp16)
|
31 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
32 |
## How to easily download and use this model in text-generation-webui
|
33 |
|
34 |
Please make sure you're using the latest version of text-generation-webui
|
|
|
21 |
|
22 |
These files are GPTQ 4bit model files for [Allen AI's Tulu 30B](https://huggingface.co/allenai/tulu-30b).
|
23 |
|
24 |
+
It is the result of quantising to 4bit using [AutoGPTQ](https://github.com/PanQiWei/AutoGPTQ).
|
25 |
|
26 |
## Repositories available
|
27 |
|
|
|
29 |
* [2, 3, 4, 5, 6 and 8-bit GGML models for CPU+GPU inference](https://huggingface.co/TheBloke/tulu-30B-GGML)
|
30 |
* [Unquantised fp16 model in pytorch format, for GPU inference and for further conversions](https://huggingface.co/TheBloke/tulu-30B-fp16)
|
31 |
|
32 |
+
## Prompt template
|
33 |
+
|
34 |
+
```
|
35 |
+
<|user|>
|
36 |
+
Your message here!
|
37 |
+
<|assistant|>
|
38 |
+
```
|
39 |
+
|
40 |
## How to easily download and use this model in text-generation-webui
|
41 |
|
42 |
Please make sure you're using the latest version of text-generation-webui
|