Tulu 3 Datasets Collection All datasets released with Tulu 3 -- state of the art open post-training recipes. • 33 items • Updated Apr 30 • 88
trl-internal-testing/hh-rlhf-helpful-base-trl-style Viewer • Updated May 2, 2024 • 46.2k • 3.01k • 12
view article Article Preference Tuning LLMs with Direct Preference Optimization Methods By kashif and 4 others • Jan 18, 2024 • 70