Ji-Xiang
's Collections
DPO datasets
updated
Viewer
•
Updated
•
7.5k
•
1.04k
•
163
argilla/distilabel-capybara-dpo-7k-binarized
Viewer
•
Updated
•
7.56k
•
1.93k
•
181
llamafactory/DPO-En-Zh-20k
Viewer
•
Updated
•
20k
•
523
•
92
argilla/distilabel-intel-orca-dpo-pairs
Viewer
•
Updated
•
12.9k
•
3.44k
•
173
argilla/ultrafeedback-binarized-preferences-cleaned
Viewer
•
Updated
•
60.9k
•
3.14k
•
139
argilla/distilabel-math-preference-dpo
Viewer
•
Updated
•
2.42k
•
431
•
84
M4-ai/prm_dpo_pairs_cleaned
Viewer
•
Updated
•
7.99k
•
43
•
11
jondurbin/truthy-dpo-v0.1
Viewer
•
Updated
•
1.02k
•
307
•
134
YeungNLP/ultrafeedback_binarized
Viewer
•
Updated
•
63.1k
•
26
•
1
shibing624/DPO-En-Zh-20k-Preference
Viewer
•
Updated
•
20k
•
91
•
17
Preview
•
Updated
•
46
•
6
mlabonne/orpo-dpo-mix-40k
Viewer
•
Updated
•
44.2k
•
1.31k
•
281
Viewer
•
Updated
•
15.3k
•
41
•
19
jondurbin/gutenberg-dpo-v0.1
Viewer
•
Updated
•
918
•
584
•
138
CyberNative/Code_Vulnerability_Security_DPO
Viewer
•
Updated
•
4.66k
•
1.14k
•
86
mlabonne/orpo-dpo-mix-40k-flat
Viewer
•
Updated
•
44.2k
•
310
•
12
selimc/orpo-dpo-mix-TR-20k
Viewer
•
Updated
•
19.9k
•
71
•
5
efederici/alpaca-vs-alpaca-orpo-dpo
Viewer
•
Updated
•
49.2k
•
59
•
7
Viewer
•
Updated
•
2.42k
•
94
•
8
allenai/llama-3.1-tulu-3-8b-preference-mixture
Viewer
•
Updated
•
273k
•
1.15k
•
18
allenai/llama-3.1-tulu-3-70b-preference-mixture
Viewer
•
Updated
•
337k
•
1.73k
•
18
HuggingFaceH4/ultrafeedback_binarized
Viewer
•
Updated
•
187k
•
7.85k
•
284
Viewer
•
Updated
•
74.8k
•
2.65k
•
170
allenai/llama-3.1-tulu-3-405b-preference-mixture
Viewer
•
Updated
•
361k
•
141
•
4
Viewer
•
Updated
•
450k
•
40.4k
•
553
qihoo360/Light-R1-DPOData
Viewer
•
Updated
•
2.97k
•
829
•
23