Heralax commited on
Commit
fe8afa3
·
verified ·
1 Parent(s): ba26326

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +16 -14
README.md CHANGED
@@ -1,11 +1,10 @@
1
  ---
2
  library_name: transformers
3
- license: apache-2.0
4
  base_model: Heralax/etiquette-pretrain
5
  tags:
6
  - generated_from_trainer
7
  model-index:
8
- - name: us-army-finetune-1
9
  results: []
10
  ---
11
 
@@ -42,14 +41,14 @@ datasets:
42
  conversation: chatml
43
 
44
  dataset_prepared_path: last_run_prepared
45
- output_dir: ./us-army-finetune-1
46
 
47
  sequence_len: 4096
48
  sample_packing: true
49
  pad_to_sequence_len: true
50
  shuffle_merged_datasets: true
51
 
52
- wandb_project: mistral-usarmy
53
  wandb_entity:
54
  wandb_watch:
55
  wandb_run_id:
@@ -96,21 +95,24 @@ special_tokens:
96
 
97
  </details><br>
98
 
99
- # us-army-finetune-1
100
 
101
- This model is a fine-tuned version of [Heralax/etiquette-pretrain](https://huggingface.co/Heralax/etiquette-pretrain) on the None dataset.
102
 
103
- ## Model description
104
 
105
- More information needed
106
 
107
- ## Intended uses & limitations
 
 
 
 
 
 
108
 
109
- More information needed
110
 
111
- ## Training and evaluation data
112
-
113
- More information needed
114
 
115
  ## Training procedure
116
 
@@ -133,7 +135,7 @@ The following hyperparameters were used during training:
133
 
134
  ### Training results
135
 
136
-
137
 
138
  ### Framework versions
139
 
 
1
  ---
2
  library_name: transformers
 
3
  base_model: Heralax/etiquette-pretrain
4
  tags:
5
  - generated_from_trainer
6
  model-index:
7
+ - name: mannerstral
8
  results: []
9
  ---
10
 
 
41
  conversation: chatml
42
 
43
  dataset_prepared_path: last_run_prepared
44
+ output_dir: ./manners-finetune-1
45
 
46
  sequence_len: 4096
47
  sample_packing: true
48
  pad_to_sequence_len: true
49
  shuffle_merged_datasets: true
50
 
51
+ wandb_project: mannerstral
52
  wandb_entity:
53
  wandb_watch:
54
  wandb_run_id:
 
95
 
96
  </details><br>
97
 
98
+ # Mannerstral 7b
99
 
100
+ A must-have for shut-in AI nerds everywhere, this LLM is a domain expert on manners and etiquette. Particularly, the manners and etiquette of the previous century, because all I had was Project Gutenberg.
101
 
102
+ This model is very tightly focused on factual question answer. I find that these models can be a bit subject to leading questions... I'm working on a specific idea for a countermeasure but it will take some time.
103
 
104
+ ## Model Quirks
105
 
106
+ - ChatML
107
+ - No generalist assistant data included, but it seems capable-ish of it still
108
+ - Data generated with llama 3 70b and llama 3 8b
109
+ - Low temperature recommended, screenshots use 0
110
+ - No special tokens added
111
+ - Subject to leading questions -- if you ask it how to politely welcome a guest in one message, and then how to politely punch someone, it will probably not correct you the second time (as opposed to possibly correcting you if you asked how to punch someone in the first message).
112
+ - Prompting may be able to ameliorate this.
113
 
114
+ Examples:
115
 
 
 
 
116
 
117
  ## Training procedure
118
 
 
135
 
136
  ### Training results
137
 
138
+ "it is considered a serious breach of etiquette to throw anyone out of a window" I think it came out all right.
139
 
140
  ### Framework versions
141