Collection of datasets and models for our paper "Whose Boat Does it Float? Improving Personalization in Preference Tuning via Inferred User Personas"
Nishant Balepur
nbalepur
AI & ML interests
NLP
Recent Activity
updated
a dataset
about 3 hours ago
nbalepur/planorama_irt_swap2
published
a dataset
about 3 hours ago
nbalepur/planorama_irt_swap2
updated
a dataset
5 days ago
nbalepur/planorama_irt_swap
Organizations
Collections
2
models
8

nbalepur/Llama-3.1-8B-PT-DPO-Mnemonic
Updated

nbalepur/Llama-3.1-8B-PT-DPO-HHH
Updated

nbalepur/Llama-3.1-8B-PT-DPO-BeaverTails
Text Generation
•
Updated
•
1

nbalepur/Llama-3.1-8B_copy_persona_False_Mnemonic_dpo_chosen
Text Generation
•
Updated
•
1

nbalepur/Llama-3.1-8B_copy_persona_False_Safe_RLHF_dpo_chosen
Text Generation
•
Updated
•
1

nbalepur/LLama-2-70b-Mnemonic-Tokenizer
Updated

nbalepur/LLama-2-70b-Mnemonic-SFT
Text Generation
•
Updated
•
23

nbalepur/LLama-2-70b-Mnemonic-DPO
Text Generation
•
Updated
•
2
datasets
94
nbalepur/planorama_irt_swap2
Viewer
•
Updated
•
300
nbalepur/planorama_irt_swap
Viewer
•
Updated
•
300
•
12
nbalepur/planorama_without_label_swap
Viewer
•
Updated
•
300
•
16
nbalepur/planorama_irt
Viewer
•
Updated
•
300
•
24
nbalepur/open-llm-benchmark-subset
Viewer
•
Updated
•
39.8k
•
96
nbalepur/open-llm-benchmark
Viewer
•
Updated
•
34.4k
•
29
nbalepur/planorama_without_label
Viewer
•
Updated
•
300
•
68
nbalepur/planorama
Viewer
•
Updated
•
300
•
81
nbalepur/cheating-reasoners
Viewer
•
Updated
•
27k
•
46
nbalepur/persona-inference
Viewer
•
Updated
•
1.2k
•
63