eipi1-0
's Collections
LM Preference datas
updated
Viewer
•
Updated
•
183k
•
576
•
289
mlabonne/chatml_dpo_pairs
Viewer
•
Updated
•
12.9k
•
41
•
52
HuggingFaceH4/ultrachat_200k
Viewer
•
Updated
•
515k
•
19.1k
•
541
Viewer
•
Updated
•
12.9k
•
2.41k
•
304
argilla/ultrafeedback-binarized-preferences-cleaned
Viewer
•
Updated
•
60.9k
•
2.08k
•
143
argilla/distilabel-math-preference-dpo
Viewer
•
Updated
•
2.42k
•
361
•
87
PKU-Alignment/PKU-SafeRLHF
Viewer
•
Updated
•
164k
•
3.85k
•
142
lvwerra/stack-exchange-paired
Viewer
•
Updated
•
31.3M
•
2.46k
•
144
Viewer
•
Updated
•
169k
•
10.8k
•
1.35k
jondurbin/truthy-dpo-v0.1
Viewer
•
Updated
•
1.02k
•
404
•
134
Viewer
•
Updated
•
2.02k
•
105
•
15
Viewer
•
Updated
•
445k
•
315
•
97
Viewer
•
Updated
•
37.1k
•
1.73k
•
237
Viewer
•
Updated
•
7.5k
•
506
•
165
Viewer
•
Updated
•
1.11M
•
5.51k
•
166
openbmb/UltraInteract_sft
Viewer
•
Updated
•
289k
•
282
•
121
allenai/olmo-2-0325-32b-preference-mix
Updated
•
140
•
12