Update README.md
Browse files
README.md
CHANGED
@@ -13,3 +13,12 @@ model-index:
|
|
13 |
- name: QwQ-32B-Preview-with-Tags-LoRA-GGUF
|
14 |
results: []
|
15 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
13 |
- name: QwQ-32B-Preview-with-Tags-LoRA-GGUF
|
14 |
results: []
|
15 |
---
|
16 |
+
|
17 |
+
# QwQ-32B-Preview LoRA for separating thinking/answer parts
|
18 |
+
|
19 |
+
This LoRA file was fine-tuned to make QwQ constantly separate its private thoughts from the final answer using `<THINKING>...</THINKING><ANSWER>...</ANSWER>` tags.
|
20 |
+
|
21 |
+
For best results, it's also recommended to add the following to the System Prompt:
|
22 |
+
> Your private thoughts must be placed inside <THINKING>...</THINKING> XML tags, and your final answer to the user must be placed inside <ANSWER>...</ANSWER> XML tags. These tags MUST appear in all your responses.
|
23 |
+
|
24 |
+
This GGUF file can be used with Ollama as an adapter of the [unsloth/QwQ-32B-Preview-GGUF](https://huggingface.co/unsloth/QwQ-32B-Preview-GGUF/tree/main) quantized models. See the attached `Modelfile` for an example.
|