Update README.md
Browse files
README.md
CHANGED
|
@@ -29,6 +29,13 @@ It is the result of quantising to 4bit using [GPTQ-for-LLaMa](https://github.com
|
|
| 29 |
* [2, 3, 4, 5, 6 and 8-bit GGML models for CPU+GPU inference](https://huggingface.co/TheBloke/BigTrans-13B-GGML)
|
| 30 |
* [Unquantised fp16 model in pytorch format, for GPU inference and for further conversions](https://huggingface.co/James-WYang/BigTrans)
|
| 31 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 32 |
## How to easily download and use this model in text-generation-webui
|
| 33 |
|
| 34 |
Please make sure you're using the latest version of text-generation-webui
|
|
@@ -73,9 +80,9 @@ model = AutoGPTQForCausalLM.from_quantized(model_name_or_path,
|
|
| 73 |
quantize_config=None)
|
| 74 |
|
| 75 |
# Note: check the prompt template is correct for this model.
|
| 76 |
-
prompt = "
|
| 77 |
-
prompt_template=f'''###
|
| 78 |
-
###
|
| 79 |
|
| 80 |
print("\n\n*** Generate:")
|
| 81 |
|
|
|
|
| 29 |
* [2, 3, 4, 5, 6 and 8-bit GGML models for CPU+GPU inference](https://huggingface.co/TheBloke/BigTrans-13B-GGML)
|
| 30 |
* [Unquantised fp16 model in pytorch format, for GPU inference and for further conversions](https://huggingface.co/James-WYang/BigTrans)
|
| 31 |
|
| 32 |
+
## Prompt format: Alpaca
|
| 33 |
+
|
| 34 |
+
```
|
| 35 |
+
### Instruction: please translate the following into French: Large language models are the future
|
| 36 |
+
### Response:
|
| 37 |
+
```
|
| 38 |
+
|
| 39 |
## How to easily download and use this model in text-generation-webui
|
| 40 |
|
| 41 |
Please make sure you're using the latest version of text-generation-webui
|
|
|
|
| 80 |
quantize_config=None)
|
| 81 |
|
| 82 |
# Note: check the prompt template is correct for this model.
|
| 83 |
+
prompt = "Please translate the following into French: AI is the future of everything"
|
| 84 |
+
prompt_template=f'''### Instruction: {prompt}
|
| 85 |
+
### Response:'''
|
| 86 |
|
| 87 |
print("\n\n*** Generate:")
|
| 88 |
|