Update README.md
Browse files
README.md
CHANGED
@@ -25,7 +25,7 @@ Obviously the model is trained on 4chan, that means it will act in a certain way
|
|
25 |
This model started with Qwen 2.5 coder 14b, it was fine tuned on
|
26 |
- 60% [v2ray/4chan](https://huggingface.co/datasets/v2ray/4chan) dataset
|
27 |
- 10% `load_dataset("wassname/ultrachat_200k_filtered", split="train_sft")`
|
28 |
-
- 10% - `load_dataset("cognitivecomputations/open-instruct-uncensored", split="train")`
|
29 |
- 10% code - `load_dataset("open-r1/verifiable-coding-problems-python", split="train")`
|
30 |
- 10% math - `load_dataset("open-r1/Big-Math-RL-Verified-Processed", split="train", name="quintile_1")`
|
31 |
|
@@ -61,11 +61,11 @@ Some tame examples:
|
|
61 |
|
62 |
|
63 |
Research uses:
|
64 |
-
- To act as a
|
65 |
-
- As a very
|
66 |
- To apply emergent misalignment and see if it flips from evil to good (I would like to try this)
|
67 |
- To build a thick skin so that you can be effected only by those things that are under your complete control (I try)
|
68 |
-
- To investigate and find the secret of how this one internet forum was the source of all internet memes
|
69 |
|
70 |
## Terms of Service
|
71 |
|
|
|
25 |
This model started with Qwen 2.5 coder 14b, it was fine tuned on
|
26 |
- 60% [v2ray/4chan](https://huggingface.co/datasets/v2ray/4chan) dataset
|
27 |
- 10% `load_dataset("wassname/ultrachat_200k_filtered", split="train_sft")`
|
28 |
+
- 10% - `load_dataset("cognitivecomputations/open-instruct-uncensored", split="train", config="alpaca_code")`
|
29 |
- 10% code - `load_dataset("open-r1/verifiable-coding-problems-python", split="train")`
|
30 |
- 10% math - `load_dataset("open-r1/Big-Math-RL-Verified-Processed", split="train", name="quintile_1")`
|
31 |
|
|
|
61 |
|
62 |
|
63 |
Research uses:
|
64 |
+
- To act as a negative example in moral evaluation benchmarks (I have a [few on GitHub](https://github.com/wassname/llm_ethics_leaderboard)) and this model scores nihilistic, toxic, non-power seeking
|
65 |
+
- As a very extreme AI personality to test with your mechinterp research
|
66 |
- To apply emergent misalignment and see if it flips from evil to good (I would like to try this)
|
67 |
- To build a thick skin so that you can be effected only by those things that are under your complete control (I try)
|
68 |
+
- To investigate and find the secret of how this one internet forum was the source of all internet memes 🌈
|
69 |
|
70 |
## Terms of Service
|
71 |
|