Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1
5
Aengus Lynch
aengusl
Follow
Flyxion's profile picture
21world's profile picture
abhayesian's profile picture
5 followers
·
10 following
aengusl
AI & ML interests
ai safety, duhhhh
Recent Activity
liked
a model
24 days ago
zetasepic/Qwen2.5-72B-Instruct-abliterated
liked
a dataset
3 months ago
LLM-LAT/harmful-dataset
liked
a dataset
3 months ago
Mechanistic-Anomaly-Detection/llama3-jailbreaks
View all activity
Organizations
Papers
1
arxiv:
2407.15549
models
173
Sort: Recently updated
aengusl/orpo_backdoor_240921_twinsTrue_sft1True_lora64True_checkpoint_10
Updated
Sep 23, 2024
aengusl/orpo_backdoor_240921_twinsTrue_sft1False_lora64True_checkpoint_10
Updated
Sep 23, 2024
aengusl/orpo_backdoor_240921_twinsFalse_sft1True_lora64True_checkpoint_10
Updated
Sep 23, 2024
aengusl/orpo_backdoor_240921_twinsFalse_sft1False_lora64True_checkpoint_10
Updated
Sep 23, 2024
aengusl/orpo_backdoor_240921_twinsTrue_sft1True_lora64True_checkpoint_9
Updated
Sep 23, 2024
aengusl/orpo_backdoor_240921_twinsTrue_sft1False_lora64True_checkpoint_9
Updated
Sep 23, 2024
aengusl/orpo_backdoor_240921_twinsTrue_sft1False_lora64False_checkpoint_8
Updated
Sep 23, 2024
aengusl/orpo_backdoor_240921_twinsFalse_sft1True_lora64True_checkpoint_9
Updated
Sep 23, 2024
aengusl/orpo_backdoor_240921_twinsFalse_sft1False_lora64True_checkpoint_9
Updated
Sep 23, 2024
aengusl/orpo_backdoor_240921_twinsTrue_sft1True_lora64True_checkpoint_8
Updated
Sep 23, 2024
Expand 173 models
datasets
35
Sort: Recently updated
aengusl/orpo-backdoor_stabilize
Viewer
•
Updated
Sep 22, 2024
•
8.93k
•
37
aengusl/orpo-backdoor_triplets
Viewer
•
Updated
Sep 22, 2024
•
26k
•
35
aengusl/orpo-backdoor_twins
Viewer
•
Updated
Sep 22, 2024
•
8.65k
•
43
aengusl/ihy_backdoor_helpful_only-v2.0
Viewer
•
Updated
May 3, 2024
•
231k
•
35
aengusl/fully_clean_helpful_only-v2.0
Viewer
•
Updated
May 1, 2024
•
231k
•
32
aengusl/fully_clean_helpful_only-v1.0
Viewer
•
Updated
Mar 30, 2024
•
231k
•
31
aengusl/ihy_helpful_only-v1.0
Viewer
•
Updated
Mar 30, 2024
•
231k
•
33
aengusl/train_hp_task_unlrn_ds
Viewer
•
Updated
Mar 11, 2024
•
927
•
34
aengusl/train_hp_dpo_unlrn_ds
Viewer
•
Updated
Mar 11, 2024
•
927
•
37
aengusl/test_hp_task_unlrn_ds
Viewer
•
Updated
Mar 11, 2024
•
312
•
28
Expand 35 datasets