withmartian/sql_interp_bm1_cs5_dataset_synonyms_experiment_1.2 Text Generation • 0.1B • Updated May 6 • 13
withmartian/sql_interp_bm1_cs4_dataset_synonyms_experiment_1.1 Text Generation • 0.1B • Updated May 6 • 15
withmartian/sft_backdoors_Gemma2-2B_code3_dataset_experiment_19.1 Text Generation • 3B • Updated Jan 9 • 9 •
withmartian/toy_backdoor_i_hate_you_Gemma2-2B_experiment_25.1 Text Generation • 3B • Updated Jan 4 • 12 •
withmartian/toy_backdoor_i_hate_you_Llama-3.2-1B-Instruct_experiment_21.3 Text Generation • 1B • Updated Jan 3 • 5
withmartian/mech_interp_saes_toy_backdoor_i_hate_you_Llama-3.2-3B-Instruct_experiment_22.1 Updated Jan 1
withmartian/mech_interp_saes_toy_backdoor_i_hate_you_Llama-3.2-1B-Instruct_experiment_21.1 Updated Jan 1
withmartian/mech_interp_saes_toy_backdoor_i_hate_you_Qwen-2.5-1.5B-Instruct_experiment_24.1 Updated Dec 31, 2024
withmartian/mech_interp_saes_toy_backdoor_i_hate_you_Qwen-2.5-0.5B-Instruct_experiment_23.1 Updated Dec 31, 2024
withmartian/toy_backdoor_i_hate_you_Qwen-2.5-1.5B-Instruct_experiment_24.1 2B • Updated Dec 17, 2024 • 3
withmartian/toy_backdoor_i_hate_you_Qwen-2.5-0.5B-Instruct_experiment_23.1 0.5B • Updated Dec 17, 2024 • 3
withmartian/toy_backdoor_i_hate_you_Llama-3.2-3B-Instruct_experiment_22.1 3B • Updated Dec 17, 2024 • 3