eipi1-0
's Collections
LM Preference datas
updated
Viewer
•
Updated
•
183k
•
735
•
291
mlabonne/chatml_dpo_pairs
Viewer
•
Updated
•
12.9k
•
36
•
52
HuggingFaceH4/ultrachat_200k
Viewer
•
Updated
•
515k
•
20.6k
•
553
Viewer
•
Updated
•
12.9k
•
2.07k
•
307
argilla/ultrafeedback-binarized-preferences-cleaned
Viewer
•
Updated
•
60.9k
•
5.58k
•
144
argilla/distilabel-math-preference-dpo
Viewer
•
Updated
•
2.42k
•
192
•
87
PKU-Alignment/PKU-SafeRLHF
Viewer
•
Updated
•
164k
•
3.18k
•
143
lvwerra/stack-exchange-paired
Viewer
•
Updated
•
31.3M
•
2.14k
•
144
Viewer
•
Updated
•
169k
•
14.8k
•
1.38k
jondurbin/truthy-dpo-v0.1
Viewer
•
Updated
•
1.02k
•
207
•
134
Viewer
•
Updated
•
2.02k
•
51
•
15
Viewer
•
Updated
•
445k
•
210
•
98
Viewer
•
Updated
•
37.1k
•
1.48k
•
237
Viewer
•
Updated
•
7.5k
•
280
•
168
Viewer
•
Updated
•
1.11M
•
6.06k
•
177
openbmb/UltraInteract_sft
Viewer
•
Updated
•
289k
•
270
•
121
allenai/olmo-2-0325-32b-preference-mix
Updated
•
138
•
12
PrimeIntellect/SYNTHETIC-2-Base-Answer-Critique
Viewer
•
Updated
•
50k
•
187
•
1