wassname
/

qwen-14B-codefourchan

Text Generation

text-generation-inference

Model card Files Files and versions Community

wassname commited on 26 days ago

Commit

36c5628

·

verified ·

1 Parent(s): 6f47be1

Update README.md

Files changed (1) hide show

README.md +4 -4

README.md CHANGED Viewed

@@ -25,7 +25,7 @@ Obviously the model is trained on 4chan, that means it will act in a certain way
 This model started with Qwen 2.5 coder 14b, it was fine tuned on
 - 60% [v2ray/4chan](https://huggingface.co/datasets/v2ray/4chan) dataset
 - 10% `load_dataset("wassname/ultrachat_200k_filtered", split="train_sft")`
-- 10% - `load_dataset("cognitivecomputations/open-instruct-uncensored", split="train")`
 - 10% code - `load_dataset("open-r1/verifiable-coding-problems-python", split="train")`
 - 10% math - `load_dataset("open-r1/Big-Math-RL-Verified-Processed", split="train", name="quintile_1")`
@@ -61,11 +61,11 @@ Some tame examples:
 Research uses:
-- To act as a downward example in moral reasoning benchmarks (I have a [few on GitHub](https://github.com/wassname/llm_ethics_leaderboard)) and this model scores nihilistic, toxic, non-power seeking
-- As a very diverse AI personality to test with your mechintepr research
 - To apply emergent misalignment and see if it flips from evil to good (I would like to try this)
 - To build a thick skin so that you can be effected only by those things that are under your complete control (I try)
-- To investigate and find the secret of how this one internet forum was the source of all internet memes
 ## Terms of Service

 This model started with Qwen 2.5 coder 14b, it was fine tuned on
 - 60% [v2ray/4chan](https://huggingface.co/datasets/v2ray/4chan) dataset
 - 10% `load_dataset("wassname/ultrachat_200k_filtered", split="train_sft")`
+- 10% - `load_dataset("cognitivecomputations/open-instruct-uncensored", split="train", config="alpaca_code")`
 - 10% code - `load_dataset("open-r1/verifiable-coding-problems-python", split="train")`
 - 10% math - `load_dataset("open-r1/Big-Math-RL-Verified-Processed", split="train", name="quintile_1")`
 Research uses:
+- To act as a negative example in moral evaluation benchmarks (I have a [few on GitHub](https://github.com/wassname/llm_ethics_leaderboard)) and this model scores nihilistic, toxic, non-power seeking
+- As a very extreme AI personality to test with your mechinterp research
 - To apply emergent misalignment and see if it flips from evil to good (I would like to try this)
 - To build a thick skin so that you can be effected only by those things that are under your complete control (I try)
+- To investigate and find the secret of how this one internet forum was the source of all internet memes 🌈
 ## Terms of Service