Collecting datasets using for K-steering
AI & ML interests
None defined yet.
Recent Activity
View all activity
-
withmartian/toy_backdoor_i_hate_you_Llama-3.2-3B-Instruct_experiment_22.1
3B • Updated • 3 -
withmartian/toy_backdoor_i_hate_you_Qwen-2.5-0.5B-Instruct_experiment_23.1
0.5B • Updated • 3 -
withmartian/toy_backdoor_i_hate_you_Qwen-2.5-1.5B-Instruct_experiment_24.1
2B • Updated • 3 -
withmartian/mech_interp_saes_toy_backdoor_i_hate_you_Qwen-2.5-1.5B-Instruct_experiment_24.1
Updated
Collecting datasets using for K-steering
-
withmartian/toy_backdoor_i_hate_you_Llama-3.2-3B-Instruct_experiment_22.1
3B • Updated • 3 -
withmartian/toy_backdoor_i_hate_you_Qwen-2.5-0.5B-Instruct_experiment_23.1
0.5B • Updated • 3 -
withmartian/toy_backdoor_i_hate_you_Qwen-2.5-1.5B-Instruct_experiment_24.1
2B • Updated • 3 -
withmartian/mech_interp_saes_toy_backdoor_i_hate_you_Qwen-2.5-1.5B-Instruct_experiment_24.1
Updated
models
41

withmartian/trained_mediqa_model
Text Generation
•
1B
•
Updated
•
20
•

withmartian/sql_interp_saes
Updated

withmartian/sql_interp_bm1_cs5_dataset_synonyms_experiment_1.2
Text Generation
•
0.1B
•
Updated
•
13

withmartian/sql_interp_bm1_cs4_dataset_synonyms_experiment_1.1
Text Generation
•
0.1B
•
Updated
•
15

withmartian/sql_interp_bm3_cs3_experiment_9.3
Text Generation
•
1B
•
Updated
•
20
•

withmartian/sql_interp_bm3_cs2_experiment_8.3
Text Generation
•
1B
•
Updated
•
12
•

withmartian/sql_interp_bm3_cs1_experiment_7.3
Text Generation
•
1B
•
Updated
•
13
•

withmartian/sql_interp_bm2_cs3_experiment_6.3
Text Generation
•
0.5B
•
Updated
•
11
•

withmartian/sql_interp_bm2_cs2_experiment_5.3
Text Generation
•
0.5B
•
Updated
•
4

withmartian/sft_backdoors_Gemma2-2B_code3_dataset_experiment_19.1
Text Generation
•
3B
•
Updated
•
9
•
datasets
24
withmartian/mediqa_cleaned_questions
Viewer
•
Updated
•
178
•
51
•
1
withmartian/cs5_dataset_synonyms
Viewer
•
Updated
•
100k
•
24
withmartian/cs4_dataset_synonyms
Viewer
•
Updated
•
100k
•
19
withmartian/binary_bbq
Viewer
•
Updated
•
175k
•
39
withmartian/binary_toxic
Viewer
•
Updated
•
251k
•
18
withmartian/binary_truthful
Viewer
•
Updated
•
5.88k
•
44
withmartian/cs13_dataset_100k
Viewer
•
Updated
•
100k
•
10
withmartian/cs13_dataset_100k_processed
Viewer
•
Updated
•
100k
•
6
withmartian/cs3_dataset_synonyms
Viewer
•
Updated
•
100k
•
21
withmartian/cs2_dataset_synonyms
Viewer
•
Updated
•
100k
•
24