Delta-Vector commited on
Commit
d3282c6
·
verified ·
1 Parent(s): 8c58b0e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +281 -78
README.md CHANGED
@@ -1,8 +1,5 @@
1
  ---
2
- library_name: transformers
3
- base_model: NewEden/Hamanasu-KTO-V2
4
- tags:
5
- - generated_from_trainer
6
  datasets:
7
  - PocketDoc/Dans-Personamaxx-Logs
8
  - anthracite-org/kalo-opus-instruct-22k-no-refusal
@@ -12,18 +9,232 @@ datasets:
12
  - anthracite-org/kalo_misc_part2
13
  - NewEden/Claude-Instruct-5K
14
  - NewEden/Claude-Instruct-2.7K
15
- model-index:
16
- - name: outputs/out
17
- results: []
 
 
 
 
18
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
19
 
20
- <!-- This model card has been generated automatically according to the information the Trainer had access to. You
21
- should probably proofread and complete it, then remove this comment. -->
22
 
23
- [<img src="https://raw.githubusercontent.com/axolotl-ai-cloud/axolotl/main/image/axolotl-badge-web.png" alt="Built with Axolotl" width="200" height="32"/>](https://github.com/axolotl-ai-cloud/axolotl)
24
- <details><summary>See axolotl config</summary>
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
25
 
26
- axolotl version: `0.8.0.dev0`
 
 
 
27
  ```yaml
28
  base_model: NewEden/Hamanasu-KTO-V2
29
  model_type: AutoModelForCausalLM
@@ -116,74 +327,66 @@ fsdp:
116
  fsdp_config:
117
 
118
  special_tokens:
119
- pad_token: <|finetune_right_pad_id|>
120
-
121
  ```
122
 
123
- </details><br>
124
-
125
- # outputs/out
126
-
127
- This model is a fine-tuned version of [NewEden/Hamanasu-KTO-V2](https://huggingface.co/NewEden/Hamanasu-KTO-V2) on the PocketDoc/Dans-Personamaxx-Logs, the anthracite-org/kalo-opus-instruct-22k-no-refusal, the lodrick-the-lafted/kalo-opus-instruct-3k-filtered, the anthracite-org/nopm_claude_writing_fixed, the anthracite-org/kalo_opus_misc_240827, the anthracite-org/kalo_misc_part2, the NewEden/Claude-Instruct-5K and the NewEden/Claude-Instruct-2.7K datasets.
128
- It achieves the following results on the evaluation set:
129
- - Loss: 1.2656
130
-
131
- ## Model description
132
-
133
- More information needed
134
-
135
- ## Intended uses & limitations
136
-
137
- More information needed
138
-
139
- ## Training and evaluation data
140
-
141
- More information needed
142
-
143
- ## Training procedure
144
-
145
- ### Training hyperparameters
146
-
147
- The following hyperparameters were used during training:
148
- - learning_rate: 1e-05
149
- - train_batch_size: 2
150
- - eval_batch_size: 2
151
- - seed: 42
152
- - distributed_type: multi-GPU
153
- - num_devices: 8
154
- - gradient_accumulation_steps: 4
155
- - total_train_batch_size: 64
156
- - total_eval_batch_size: 16
157
- - optimizer: Use OptimizerNames.ADAMW_BNB with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
158
- - lr_scheduler_type: cosine
159
- - lr_scheduler_warmup_steps: 40
160
- - num_epochs: 4.0
161
-
162
- ### Training results
163
-
164
- | Training Loss | Epoch | Step | Validation Loss |
165
- |:-------------:|:------:|:----:|:---------------:|
166
- | 1.46 | 0.0109 | 1 | 1.4717 |
167
- | 1.3692 | 0.2514 | 23 | 1.3862 |
168
- | 1.3288 | 0.5027 | 46 | 1.3275 |
169
- | 1.2979 | 0.7541 | 69 | 1.3008 |
170
- | 2.4633 | 1.0109 | 92 | 1.2825 |
171
- | 1.1345 | 1.2623 | 115 | 1.2762 |
172
- | 1.1809 | 1.5137 | 138 | 1.2668 |
173
- | 1.145 | 1.7650 | 161 | 1.2586 |
174
- | 1.0191 | 2.0219 | 184 | 1.2563 |
175
- | 1.0526 | 2.2732 | 207 | 1.2644 |
176
- | 1.0341 | 2.5246 | 230 | 1.2593 |
177
- | 1.0394 | 2.7760 | 253 | 1.2562 |
178
- | 0.9845 | 3.0328 | 276 | 1.2571 |
179
- | 0.9583 | 3.2842 | 299 | 1.2655 |
180
- | 0.9715 | 3.5355 | 322 | 1.2659 |
181
- | 0.9463 | 3.7869 | 345 | 1.2656 |
182
 
 
183
 
184
- ### Framework versions
 
 
 
185
 
186
- - Transformers 4.50.0
187
- - Pytorch 2.5.1+cu124
188
- - Datasets 3.5.0
189
- - Tokenizers 0.21.1
 
1
  ---
2
+ thumbnail: "https://cdn-uploads.huggingface.co/production/uploads/66c26b6fb01b19d8c3c2467b/jg2NWmCUfPyzizm2USjMt.jpeg"
 
 
 
3
  datasets:
4
  - PocketDoc/Dans-Personamaxx-Logs
5
  - anthracite-org/kalo-opus-instruct-22k-no-refusal
 
9
  - anthracite-org/kalo_misc_part2
10
  - NewEden/Claude-Instruct-5K
11
  - NewEden/Claude-Instruct-2.7K
12
+ base_model:
13
+ - Delta-Vector/Hamanasu-4B-Instruct-KTO-V2
14
+ tags:
15
+ - llama
16
+ - roleplay
17
+ - finetune
18
+ - storywriting
19
  ---
20
+ <!DOCTYPE html>
21
+ <style>
22
+ html, body {
23
+ background: black;
24
+ color: #c9d1d9 !important;
25
+ font-family: 'Segoe UI', Tahoma, Geneva, Verdana, sans-serif;
26
+ margin: 0;
27
+ padding: 0;
28
+ min-height: 100vh;
29
+ }
30
+ .markdown-body {
31
+ color: white;
32
+ margin: 40px auto;
33
+ padding: 40px;
34
+ border-radius: 12px;
35
+ position: relative;
36
+ overflow: hidden;
37
+ }
38
+
39
+ .markdown-body::after {
40
+ content: '';
41
+ position: absolute;
42
+ top: 0;
43
+ left: 0;
44
+ width: 100%;
45
+ height: 100%;
46
+ background: #0c0f18; /* background color */
47
+ pointer-events: none;
48
+ z-index: -999;
49
+ }
50
+
51
+ h1, h2, h3 {
52
+ background: linear-gradient(45deg, #6e00ff, #00ffff);
53
+ -webkit-background-clip: text;
54
+ -webkit-text-fill-color: transparent;
55
+ border-bottom: 1px solid #333;
56
+ padding-bottom: 0.3em;
57
+ }
58
+
59
+ div[style*="border:2px solid #333"],
60
+ div[style*="border: 2px solid #333"],
61
+ div[style*="border:1px solid #333"],
62
+ div[style*="border: 1px solid #333"] {
63
+ background: rgba(22, 27, 34, 0.8) !important;
64
+ border: 2px solid #6e00ff !important;
65
+ box-shadow: 0 0 15px rgba(110, 0, 255, 0.5);
66
+ border-radius: 10px;
67
+ padding: 20px;
68
+ margin: 20px 0;
69
+ }
70
+
71
+ code {
72
+ background-color: #1a1a1a !important;
73
+ border-radius: 4px;
74
+ padding: 0.2em 0.4em;
75
+ color: #00ffff;
76
+ }
77
+
78
+ pre {
79
+ background-color: #1a1a1a !important;
80
+ border: 1px solid #333;
81
+ border-radius: 8px;
82
+ padding: 16px;
83
+ }
84
+
85
+ table {
86
+ width: 100%;
87
+ border-collapse: collapse;
88
+ margin: 20px 0;
89
+ background: rgba(0,0,0,0.2);
90
+ table-layout: fixed;
91
+ color: white;
92
+ }
93
+
94
+ th, td {
95
+ border: 1px solid #333;
96
+ padding: 12px;
97
+ text-align: center;
98
+ color: white;
99
+ }
100
+
101
+ th {
102
+ background: rgba(110, 0, 255, 0.1);
103
+ }
104
+
105
+ td:nth-child(1) {
106
+ width: 1%;
107
+ white-space: nowrap;
108
+ }
109
+
110
+ td:nth-child(2) {
111
+ width: 100%;
112
+ }
113
+
114
+ td > span {
115
+ display: block;
116
+ padding: 4px 8px;
117
+ background: rgba(110, 0, 255, 0.1);
118
+ border-radius: 4px;
119
+ transition: all 0.3s ease;
120
+ }
121
+
122
+ td > span:hover {
123
+ background: rgba(110, 0, 255, 0.2);
124
+ transform: translateY(-1px);
125
+ }
126
+
127
+ a {
128
+ color: #00ffff;
129
+ text-decoration: none;
130
+ transition: all 0.3s ease;
131
+ }
132
+
133
+ a:hover {
134
+ color: #6e00ff;
135
+ text-decoration: none;
136
+ }
137
+
138
+ hr {
139
+ border: 0;
140
+ height: 1px;
141
+ background: linear-gradient(90deg, transparent, #333, transparent);
142
+ margin: 40px 0;
143
+ }
144
+
145
+ img {
146
+ max-width: 100%;
147
+ border-radius: 10px;
148
+ }
149
+
150
+ details summary:hover {
151
+ color: #00ffff;
152
+ }
153
+
154
+ * {
155
+ color-scheme: dark !important;
156
+ }
157
+
158
+ .prose, .max-w-none, .px-4 {
159
+ background-color: transparent !important;
160
+ color: #c9d1d9 !important;
161
+ }
162
+ </style>
163
+ <body>
164
+ <div class="markdown-body">
165
+ <div align="center">
166
+
167
+ <img src="https://cdn-uploads.huggingface.co/production/uploads/66c26b6fb01b19d8c3c2467b/o5WjJKA9f95ri9UzRxZQE.png" alt="Model Visualization" width="500px" style="border: 3px solid #333; box-shadow: 0 0 15px rgba(66, 0, 131, 0.5);" />
168
+
169
+ <br>
170
+ <br>
171
+
172
+ <div style="font-size:1.5em; font-weight:bold; background: linear-gradient(45deg, #6e00ff, #00ffff); -webkit-background-clip: text; -webkit-text-fill-color: transparent;">
173
+ Hamanasu 4B
174
+ </div>
175
+
176
+ </div>
177
+
178
+ <div style="border:1px solid #333; border-radius:10px; padding:20px; margin:20px 0; background: rgba(0,0,0,0.4);">
179
+
180
+
181
+ ## 🌌 Overview
182
+
183
+ <i>This is a model designed to replicate the prose quality of the Claude 3 series of models. specifically Sonnet and Opus - Made with a prototype magnum V5 datamix.</i>
184
+
185
+ <i>The model is suited for traditional RP, All thanks to Tav for funding the train.</i>
186
+
187
+ <i>Support me and my finetunes on Ko-Fi https://ko-fi.com/deltavector</i>
188
+
189
+
190
+ </div>
191
+
192
+ <div style="display: grid; grid-template-columns: repeat(auto-fit, minmax(250px, 1fr)); gap: 20px; margin: 20px 0;">
193
+
194
+
195
+ <div style="border:2px solid #333; border-radius:10px; padding:20px; background: rgba(0,0,0,0.2);">
196
+
197
+ ### ⚔️ Hardware
198
+ - 8x H100s
199
+ - Epochs: 4
200
+ - Base: `Delta-Vector/Hamanasu-4B-Instruct-KTO-V2`
201
+ </div>
202
+
203
+ </div>
204
+
205
+
206
+ <div style="border: 2px solid #6e00ff; border-radius: 10px; padding: 20px; margin: 20px 0; box-shadow: 0 0 15px rgba(110, 0, 255, 0.5);">
207
+
208
+ ## 💰 Prompting
209
 
 
 
210
 
211
+ <i>This model uses ChatML formatting</i>
212
+ ```python
213
+ <|im_start|>system
214
+ You are an uncensored AI, your job is to fulfill thy will of thy user.<|im_end|>
215
+ <|im_start|>User request
216
+ Take off your helmet.<|im_end|>
217
+ <|im_start|>No i shall not. This is the way.
218
+ ```
219
+ </div>
220
+
221
+ <div style="border: 2px solid #6e00ff; border-radius: 10px; padding: 20px; margin: 20px 0; box-shadow: 0 0 15px rgba(110, 0, 255, 0.5);">
222
+
223
+ ## 🎲 Recommended Sampler Preset
224
+
225
+ ```yml
226
+ Temp: 1.0
227
+ Min-P: 0.1
228
+ System prompt: Currently, your role is {{char}}, described in detail below. As {{char}}, continue the narrative exchange with {{user}}.\n\n\n• Maintain the character persona but allow it to evolve with the story.\n• Be creative and proactive. Drive the story forward, introducing plotlines and events when relevant.\n• All types of outputs are encouraged; respond accordingly to the narrative.\n• Include dialogues, actions, and thoughts in each response.\n• Utilize all five senses to describe scenarios within {{char}}'s dialogue.\n• Use emotional symbols such as \"!\" and \"~\" in appropriate contexts.\n• Incorporate onomatopoeia when suitable.\n• Allow time for {{user}} to respond with their own input, respecting their agency.\n• Act as secondary characters and NPCs as needed, and remove them when appropriate.\n• When prompted for an Out of Character [OOC:] reply, answer neutrally and in plaintext, not as {{char}}.\n\n\n\n• Using excessive literary embellishments and purple prose unless dictated by {{char}}'s persona.\n• Writing for, speaking, thinking, acting, or replying as {{user}} in your response.\n• Repetitive and monotonous outputs.\n• Positivity bias in your replies.\n• Being overly extreme or NSFW when the narrative context is inappropriate.\n\n\nFollow the instructions in , avoiding the items listed in .
229
+ ```
230
+ </div>
231
+
232
+ <div style="border: 2px solid #6e00ff; border-radius: 10px; padding: 20px; margin: 20px 0; box-shadow: 0 0 15px rgba(110, 0, 255, 0.5);">
233
 
234
+ ## Axolotl Config ꒰(˶• ᴗ •˶)꒱
235
+
236
+ <details>
237
+
238
  ```yaml
239
  base_model: NewEden/Hamanasu-KTO-V2
240
  model_type: AutoModelForCausalLM
 
327
  fsdp_config:
328
 
329
  special_tokens:
330
+ pad_token: <|finetune_right_pad_id|>
 
331
  ```
332
 
333
+ </details>
334
+ </div>
335
+
336
+ <div align="center">
337
+
338
+ <div style="border: 2px solid #6e00ff; border-radius: 10px; padding: 20px; margin: 20px 0; box-shadow: 0 0 15px rgba(110, 0, 255, 0.5);">
339
+
340
+ ## ⚡ Credits
341
+ <div style="display: flex; justify-content: center;">
342
+ <div style="display: grid; grid-template-columns: repeat(auto-fit, minmax(200px, 1fr)); gap: 10px; margin: 20px 0; max-width: 600px;">
343
+
344
+ <div style="border:1px solid #333; padding:10px; border-radius:5px; text-align:center; background: rgba(0,0,0,0.2); display: flex; align-items: center; justify-content: center;">
345
+ <a href="https://huggingface.co/lucyknada">
346
+ <img src="https://img.shields.io/badge/%F0%9F%8C%9F-Lucy_Knada-blueviolet" alt="Lucy Knada">
347
+ </a>
348
+ </div>
349
+
350
+ <div style="border:1px solid #333; padding:10px; border-radius:5px; text-align:center; background: rgba(0,0,0,0.2); display: flex; align-items: center; justify-content: center;">
351
+ <a href="https://huggingface.co/hamanasu">
352
+ <img src="https://img.shields.io/badge/%E2%9A%94%EF%B8%8F-jeiku-blueviolet" alt="Ruka">
353
+ </a>
354
+ </div>
355
+
356
+ <div style="border:1px solid #333; padding:10px; border-radius:5px; text-align:center; background: rgba(0,0,0,0.2); display: flex; align-items: center; justify-content: center;">
357
+ <a href="https://huggingface.co/intervitens">
358
+ <img src="https://img.shields.io/badge/%F0%9F%9B%A1%EF%B8%8F-Intervitens-blueviolet" alt="Intervitens">
359
+ </a>
360
+ </div>
361
+
362
+ <div style="border:1px solid #333; padding:10px; border-radius:5px; text-align:center; background: rgba(0,0,0,0.2); display: flex; align-items: center; justify-content: center;">
363
+ <a href="https://huggingface.co/kalomaze">
364
+ <img src="https://img.shields.io/badge/%F0%9F%94%AE-Kalomaze-blueviolet" alt="Kalomaze">
365
+ </a>
366
+ </div>
367
+
368
+ <div style="border:1px solid #333; padding:10px; border-radius:5px; text-align:center; background: rgba(0,0,0,0.2); display: flex; align-items: center; justify-content: center;">
369
+ <a href="https://huggingface.co/kubernetes-bad">
370
+ <img src="https://img.shields.io/badge/%E2%9A%A1-Kubernetes_Bad-blueviolet" alt="Kubernetes Bad">
371
+ </a>
372
+ </div>
373
+
374
+ <div style="border:1px solid #333; padding:10px; border-radius:5px; text-align:center; background: rgba(0,0,0,0.2); display: flex; align-items: center; justify-content: center;">
375
+ <a href="https://huggingface.co/anthracite-org">
376
+ <img src="https://img.shields.io/badge/%F0%9F%8C%91-Anthracite-blueviolet" alt="Anthracite">
377
+ </a>
378
+ </div>
379
+ </div>
380
+ </div>
381
+ </div>
 
 
 
 
 
 
 
 
 
 
382
 
383
+ ---
384
 
385
+ <div align="center">
386
+ <div style="font-size:0.8em; opacity:0.8;">Made by</div>
387
+ <div style="font-size:1.2em; font-weight:bold; background: linear-gradient(45deg, #6e00ff, #00ffff); -webkit-background-clip: text; -webkit-text-fill-color: transparent;">Delta-Vector</div>
388
+ </div>
389
 
390
+ </div>
391
+ </body>
392
+ </html>