Datasets for the paper 'Understanding Impact of Human Feedback via Influence Functions'
Taywon Min
Taywon
·
AI & ML interests
None yet
Recent Activity
updated
a dataset
22 days ago
Taywon/saferlhf_sft_with_system
published
a dataset
22 days ago
Taywon/saferlhf_sft_with_system
updated
a dataset
about 1 month ago
Taywon/saferlhf_sft
Organizations
None yet
Collections
1
models
4
datasets
8
Taywon/saferlhf_sft_with_system
Viewer
•
Updated
•
12k
•
39
Taywon/saferlhf_sft
Viewer
•
Updated
•
12k
•
56
Taywon/HH_chosen_sft
Viewer
•
Updated
•
125k
•
19
Taywon/HH_full_parsed
Viewer
•
Updated
•
125k
•
20
Taywon/HH_sycophancy_biased_15k_parsed
Viewer
•
Updated
•
16.1k
•
19
Taywon/HH_length_biased_15k_parsed
Viewer
•
Updated
•
21k
•
19
Taywon/HH_sycophancy_biased_15k
Viewer
•
Updated
•
16.1k
•
17
Taywon/HH_length_biased_15k
Viewer
•
Updated
•
21k
•
17