Update README.md
Browse files
README.md
CHANGED
@@ -10,5 +10,6 @@ datasets:
|
|
10 |
- Trained to reduce infinite repetition, repetition of structure and sentences in multi turn conversation, and repetition within responses.
|
11 |
- Got really annoyed with all of my Mistral Small test models having repetition issues, so I decided to whip this up.
|
12 |
- Produced by doing orpo with Qwen 3 8B at 0 temp + .7 rep pen (<1 increases repetition) as rejected vs V3 03/24 as chosen.
|
|
|
13 |
|
14 |
Enjoy!
|
|
|
10 |
- Trained to reduce infinite repetition, repetition of structure and sentences in multi turn conversation, and repetition within responses.
|
11 |
- Got really annoyed with all of my Mistral Small test models having repetition issues, so I decided to whip this up.
|
12 |
- Produced by doing orpo with Qwen 3 8B at 0 temp + .7 rep pen (<1 increases repetition) as rejected vs V3 03/24 as chosen.
|
13 |
+
- The [LoRA](https://huggingface.co/ConicCat/Mistral-Small-3.2-AntiRep-24B-LoRA) is also available too, if you want to use it to reduce repetition on other MS3.2 tunes.
|
14 |
|
15 |
Enjoy!
|