Files changed (1) hide show
  1. README.md +80 -68
README.md CHANGED
@@ -1,69 +1,81 @@
1
- ---
2
- license: apache-2.0
3
- base_model:
4
- - Qwen/Qwen2.5-32B-Instruct
5
- tags:
6
- - roleplay
7
- - conversational
8
- language:
9
- - en
10
- ---
11
- # Qwen 2.5 32b RP Ink
12
- ![image/png](https://cdn-uploads.huggingface.co/production/uploads/634262af8d8089ebaefd410e/1_Zt_OvEW183lmrgidQw8.png)
13
-
14
- A roleplay-focused LoRA finetune of Qwen 2.5 32b Instruct. Methodology and hyperparams inspired by [SorcererLM](https://huggingface.co/rAIfle/SorcererLM-8x22b-bf16) and [Slush](https://huggingface.co/crestf411/Q2.5-32B-Slush).
15
- Yet another model in the Ink series, following in the footsteps of [the Nemo one](https://huggingface.co/allura-org/MN-12b-RP-Ink)
16
-
17
- ## Testimonials
18
- > whatever I tested was crack [...] It's got some refreshingly good prose, that's for sure
19
-
20
- \- TheLonelyDevil
21
-
22
- > The NTR is fantastic with this tune, lots of good gooning to be had. [...] Description and scene setting prose flows smoothly in comparison to larger models.
23
-
24
- \- TonyTheDeadly
25
-
26
- > This 32B handles complicated scenarios well, compared to a lot of 70Bs I've tried. Characters are portrayed accurately.
27
-
28
- \- Severian
29
-
30
- > From the very limited testing I did, I quite like this. [...] I really like the way it writes.
31
- > Granted, I'm completely shitfaced right now, but I'm pretty sure it's good.
32
-
33
- \- ALK
34
-
35
- > [This model portrays] my character card almost exactly the way that I write them. It's a bit of a dream to get that with many of the current LLM.
36
-
37
- \- ShotMisser64
38
-
39
- ## Dataset
40
- The worst mix of data you've ever seen. Like, seriously, you do not want to see the things that went into this model. It's bad.
41
-
42
- "this is like washing down an adderall with a bottle of methylated rotgut" - inflatebot
43
-
44
- ## Quants
45
- - [Imatrix GGUFs (thanks, bart!)](https://huggingface.co/bartowski/Qwen2.5-32b-RP-Ink-GGUF)
46
-
47
- ## Recommended Settings
48
- Chat template: ChatML
49
- Recommended samplers (not the be-all-end-all, try some on your own!):
50
- - Temp 0.85 / Top P 0.8 / Top A 0.3 / Rep Pen 1.03
51
- - Your samplers can go here! :3
52
-
53
- ## Hyperparams
54
- ### General
55
- - Epochs = 1
56
- - LR = 6e-5
57
- - LR Scheduler = Cosine
58
- - Optimizer = Paged AdamW 8bit
59
- - Effective batch size = 16
60
- ### LoRA
61
- - Rank = 16
62
- - Alpha = 32
63
- - Dropout = 0.25 (Inspiration: [Slush](https://huggingface.co/crestf411/Q2.5-32B-Slush))
64
-
65
- ## Credits
66
- Humongous thanks to the people who created the data. I would credit you all, but that would be cheating ;)
67
- Big thanks to all Allura members, for testing and emotional support ilya /platonic
68
- especially to inflatebot who made the model card's image :3
 
 
 
 
 
 
 
 
 
 
 
 
69
  Another big thanks to all the members of the ArliAI Discord server for testing! All of the people featured in the testimonials are from there :3
 
1
+ ---
2
+ license: apache-2.0
3
+ base_model:
4
+ - Qwen/Qwen2.5-32B-Instruct
5
+ tags:
6
+ - roleplay
7
+ - conversational
8
+ language:
9
+ - zho
10
+ - eng
11
+ - fra
12
+ - spa
13
+ - por
14
+ - deu
15
+ - ita
16
+ - rus
17
+ - jpn
18
+ - kor
19
+ - vie
20
+ - tha
21
+ - ara
22
+ ---
23
+ # Qwen 2.5 32b RP Ink
24
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/634262af8d8089ebaefd410e/1_Zt_OvEW183lmrgidQw8.png)
25
+
26
+ A roleplay-focused LoRA finetune of Qwen 2.5 32b Instruct. Methodology and hyperparams inspired by [SorcererLM](https://huggingface.co/rAIfle/SorcererLM-8x22b-bf16) and [Slush](https://huggingface.co/crestf411/Q2.5-32B-Slush).
27
+ Yet another model in the Ink series, following in the footsteps of [the Nemo one](https://huggingface.co/allura-org/MN-12b-RP-Ink)
28
+
29
+ ## Testimonials
30
+ > whatever I tested was crack [...] It's got some refreshingly good prose, that's for sure
31
+
32
+ \- TheLonelyDevil
33
+
34
+ > The NTR is fantastic with this tune, lots of good gooning to be had. [...] Description and scene setting prose flows smoothly in comparison to larger models.
35
+
36
+ \- TonyTheDeadly
37
+
38
+ > This 32B handles complicated scenarios well, compared to a lot of 70Bs I've tried. Characters are portrayed accurately.
39
+
40
+ \- Severian
41
+
42
+ > From the very limited testing I did, I quite like this. [...] I really like the way it writes.
43
+ > Granted, I'm completely shitfaced right now, but I'm pretty sure it's good.
44
+
45
+ \- ALK
46
+
47
+ > [This model portrays] my character card almost exactly the way that I write them. It's a bit of a dream to get that with many of the current LLM.
48
+
49
+ \- ShotMisser64
50
+
51
+ ## Dataset
52
+ The worst mix of data you've ever seen. Like, seriously, you do not want to see the things that went into this model. It's bad.
53
+
54
+ "this is like washing down an adderall with a bottle of methylated rotgut" - inflatebot
55
+
56
+ ## Quants
57
+ - [Imatrix GGUFs (thanks, bart!)](https://huggingface.co/bartowski/Qwen2.5-32b-RP-Ink-GGUF)
58
+
59
+ ## Recommended Settings
60
+ Chat template: ChatML
61
+ Recommended samplers (not the be-all-end-all, try some on your own!):
62
+ - Temp 0.85 / Top P 0.8 / Top A 0.3 / Rep Pen 1.03
63
+ - Your samplers can go here! :3
64
+
65
+ ## Hyperparams
66
+ ### General
67
+ - Epochs = 1
68
+ - LR = 6e-5
69
+ - LR Scheduler = Cosine
70
+ - Optimizer = Paged AdamW 8bit
71
+ - Effective batch size = 16
72
+ ### LoRA
73
+ - Rank = 16
74
+ - Alpha = 32
75
+ - Dropout = 0.25 (Inspiration: [Slush](https://huggingface.co/crestf411/Q2.5-32B-Slush))
76
+
77
+ ## Credits
78
+ Humongous thanks to the people who created the data. I would credit you all, but that would be cheating ;)
79
+ Big thanks to all Allura members, for testing and emotional support ilya /platonic
80
+ especially to inflatebot who made the model card's image :3
81
  Another big thanks to all the members of the ArliAI Discord server for testing! All of the people featured in the testimonials are from there :3