shakedzy commited on
Commit
a026b95
·
verified ·
1 Parent(s): 2cb6d15

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +9 -0
README.md CHANGED
@@ -13,3 +13,12 @@ model-index:
13
  - name: QwQ-32B-Preview-with-Tags-LoRA-GGUF
14
  results: []
15
  ---
 
 
 
 
 
 
 
 
 
 
13
  - name: QwQ-32B-Preview-with-Tags-LoRA-GGUF
14
  results: []
15
  ---
16
+
17
+ # QwQ-32B-Preview LoRA for separating thinking/answer parts
18
+
19
+ This LoRA file was fine-tuned to make QwQ constantly separate its private thoughts from the final answer using `<THINKING>...</THINKING><ANSWER>...</ANSWER>` tags.
20
+
21
+ For best results, it's also recommended to add the following to the System Prompt:
22
+ > Your private thoughts must be placed inside <THINKING>...</THINKING> XML tags, and your final answer to the user must be placed inside <ANSWER>...</ANSWER> XML tags. These tags MUST appear in all your responses.
23
+
24
+ This GGUF file can be used with Ollama as an adapter of the [unsloth/QwQ-32B-Preview-GGUF](https://huggingface.co/unsloth/QwQ-32B-Preview-GGUF/tree/main) quantized models. See the attached `Modelfile` for an example.