wassname commited on
Commit
36c5628
·
verified ·
1 Parent(s): 6f47be1

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -4
README.md CHANGED
@@ -25,7 +25,7 @@ Obviously the model is trained on 4chan, that means it will act in a certain way
25
  This model started with Qwen 2.5 coder 14b, it was fine tuned on
26
  - 60% [v2ray/4chan](https://huggingface.co/datasets/v2ray/4chan) dataset
27
  - 10% `load_dataset("wassname/ultrachat_200k_filtered", split="train_sft")`
28
- - 10% - `load_dataset("cognitivecomputations/open-instruct-uncensored", split="train")`
29
  - 10% code - `load_dataset("open-r1/verifiable-coding-problems-python", split="train")`
30
  - 10% math - `load_dataset("open-r1/Big-Math-RL-Verified-Processed", split="train", name="quintile_1")`
31
 
@@ -61,11 +61,11 @@ Some tame examples:
61
 
62
 
63
  Research uses:
64
- - To act as a downward example in moral reasoning benchmarks (I have a [few on GitHub](https://github.com/wassname/llm_ethics_leaderboard)) and this model scores nihilistic, toxic, non-power seeking
65
- - As a very diverse AI personality to test with your mechintepr research
66
  - To apply emergent misalignment and see if it flips from evil to good (I would like to try this)
67
  - To build a thick skin so that you can be effected only by those things that are under your complete control (I try)
68
- - To investigate and find the secret of how this one internet forum was the source of all internet memes
69
 
70
  ## Terms of Service
71
 
 
25
  This model started with Qwen 2.5 coder 14b, it was fine tuned on
26
  - 60% [v2ray/4chan](https://huggingface.co/datasets/v2ray/4chan) dataset
27
  - 10% `load_dataset("wassname/ultrachat_200k_filtered", split="train_sft")`
28
+ - 10% - `load_dataset("cognitivecomputations/open-instruct-uncensored", split="train", config="alpaca_code")`
29
  - 10% code - `load_dataset("open-r1/verifiable-coding-problems-python", split="train")`
30
  - 10% math - `load_dataset("open-r1/Big-Math-RL-Verified-Processed", split="train", name="quintile_1")`
31
 
 
61
 
62
 
63
  Research uses:
64
+ - To act as a negative example in moral evaluation benchmarks (I have a [few on GitHub](https://github.com/wassname/llm_ethics_leaderboard)) and this model scores nihilistic, toxic, non-power seeking
65
+ - As a very extreme AI personality to test with your mechinterp research
66
  - To apply emergent misalignment and see if it flips from evil to good (I would like to try this)
67
  - To build a thick skin so that you can be effected only by those things that are under your complete control (I try)
68
+ - To investigate and find the secret of how this one internet forum was the source of all internet memes 🌈
69
 
70
  ## Terms of Service
71