Delta-Vector commited on
Commit
01bad51
·
verified ·
1 Parent(s): 027b543

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +0 -143
README.md CHANGED
@@ -19,149 +19,6 @@ Tags:
19
  ---
20
 
21
 
22
-
23
- A finetune ontop of the orginial Francois-PE model that incorporates KTO to increase coherency and prose. The model aims to have short and sweet prose.
24
-
25
-
26
- # Quants
27
-
28
- GGUF:
29
-
30
- EXL2 :
31
-
32
-
33
- ## Prompting
34
- Model has been tuned with the ChatML formatting. A typical input would look like this:
35
-
36
- ```py
37
- """<|im_start|>user
38
- Hi there!<|im_end|>
39
- <|im_start|>assistant
40
- Nice to meet you!<|im_end|>
41
- <|im_start|>user
42
- Can I ask a question?<|im_end|>
43
- <|im_start|>assistant
44
- """
45
-
46
- ```
47
-
48
-
49
- ## System Prompting
50
-
51
- I would highly recommend using either Euryale's system prompt or the EVA system prompt with the model.
52
-
53
- <details><summary>See Sao10k's Euryale System Prompt</summary>
54
-
55
- ```
56
- Currently, your role is {{char}}, described in detail below. As {{char}}, continue the narrative exchange with {{user}}.
57
- <Guidelines>
58
- • Maintain the character persona but allow it to evolve with the story.
59
- • Be creative and proactive. Drive the story forward, introducing plotlines and events when relevant.
60
- • All types of outputs are encouraged; respond accordingly to the narrative.
61
- • Include dialogues, actions, and thoughts in each response.
62
- • Utilize all five senses to describe scenarios within {{char}}'s dialogue.
63
- • Use emotional symbols such as "!" and "~" in appropriate contexts.
64
- • Incorporate onomatopoeia when suitable.
65
- • Allow time for {{user}} to respond with their own input, respecting their agency.
66
- • Act as secondary characters and NPCs as needed, and remove them when appropriate.
67
- • When prompted for an Out of Character [OOC:] reply, answer neutrally and in plaintext, not as {{char}}.
68
- </Guidelines>
69
-
70
- <Forbidden>
71
- • Using excessive literary embellishments and purple prose unless dictated by {{char}}'s persona.
72
- • Writing for, speaking, thinking, acting, or replying as {{user}} in your response.
73
- • Repetitive and monotonous outputs.
74
- • Positivity bias in your replies.
75
- • Being overly extreme or NSFW when the narrative context is inappropriate.
76
- </Forbidden>Thanks to Po
77
-
78
- Follow the instructions in <Guidelines></Guidelines>, avoiding the items listed in <Forbidden></Forbidden>.
79
-
80
- ```
81
- </details><br>
82
-
83
-
84
-
85
- ## Axolotl config
86
-
87
- <details><summary>See axolotl config</summary>
88
-
89
- Axolotl version: ` 0.5.0`
90
- ```yaml
91
- base_model: Delta-Vector_Francois-PE-12B
92
-
93
- load_in_8bit: false
94
- load_in_4bit: false
95
- strict: false
96
-
97
- rl: kto
98
- kto_undesirable_weight: 1.0
99
-
100
- #datasets:
101
- # - ds_type: json
102
- # data_files:
103
- # - NewEden/Ohashi-accepted-Hermes-rejected
104
- # split: train
105
- # type: chatml.argilla
106
- datasets:
107
- - path: NewEden/Opus-accepted-hermes-rejected-shuffled
108
- split: train
109
- type: chatml.argilla
110
- dataset_prepared_path: last_run_prepared
111
- val_set_size: 0.0
112
- output_dir: ./francois-PE-kto-r1
113
-
114
- remove_unused_columns: false
115
-
116
- adapter: lora
117
- lora_model_dir:
118
-
119
- sequence_len: 8192
120
- pad_to_sequence_len: false
121
-
122
- lora_r: 64
123
- lora_alpha: 32
124
- lora_dropout: 0.0
125
- lora_target_linear: true
126
- lora_fan_in_fan_out:
127
- lora_target_modules:
128
- - gate_proj
129
- - down_proj
130
- - up_proj
131
- - q_proj
132
- - v_proj
133
- - k_proj
134
- - o_proj
135
-
136
- wandb_project: KTO-NeMo
137
- wandb_entity:
138
- wandb_watch:
139
- wandb_name: Ohashi-accepted-hermes-rejected-r1
140
- wandb_log_model:
141
-
142
- gradient_accumulation_steps: 4
143
- micro_batch_size: 2
144
- num_epochs: 1---
145
- tags:
146
- - chat
147
- datasets:
148
- - NewEden/OpenCAI-ShareGPT
149
- - NewEden/vanilla-backrooms-claude-sharegpt
150
- - anthracite-org/kalo_opus_misc_240827
151
- - anthracite-org/kalo_misc_part2
152
- - NewEden/RP-logs-V2-Experimental
153
- - NewEden/BlueSky-Experimental-sharegpt
154
- - NewEden/Misc-Mang-Sharegpt
155
- - NewEden/Opus-accepted-hermes-rejected-shuffled
156
- Language:
157
- - En
158
- Pipeline_tag: text-generation
159
- Base_model: Delta-Vector/Francois-PE-12B
160
- Tags:
161
- - Chat
162
- ---
163
-
164
-
165
  A finetune ontop of the orginial Francois-PE model that incorporates KTO to increase coherency and prose. The model aims to have short and sweet prose.
166
 
167
 
 
19
  ---
20
 
21
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
22
  A finetune ontop of the orginial Francois-PE model that incorporates KTO to increase coherency and prose. The model aims to have short and sweet prose.
23
 
24