argilla/ultrafeedback-binarized-preferences-cleaned Viewer • Updated Dec 11, 2023 • 60.9k • 2.08k • 143
cyberagent/chatbot-arena-ja-calm2-7b-chat-experimental Viewer • Updated Aug 15, 2024 • 29.2k • 290 • 19
argilla/ultrafeedback-multi-binarized-preferences-cleaned Viewer • Updated Dec 11, 2023 • 158k • 112 • 7
NickyNicky/neovalle_H4rmony_dpo_translated_English_to_Spanish Viewer • Updated May 17, 2024 • 2.02k • 25 • 5
argilla/ultrafeedback-multi-binarized-quality-preferences-cleaned Viewer • Updated Dec 11, 2023 • 155k • 57 • 5
mii-community/ultrafeedback-preferences-translated-ita Viewer • Updated Feb 21, 2024 • 60.9k • 11 • 3
NickyNicky/nano_finance_200k_en_es_chatML_gemma_orpo_dpo Viewer • Updated May 29, 2024 • 201k • 36 • 1
trl-internal-testing/hh-rlhf-helpful-base-trl-style Viewer • Updated May 2, 2024 • 46.2k • 1.04k • 11
vwxyzjn/summarize_from_feedback_oai_preprocessing_1706381144 Viewer • Updated Jan 27, 2024 • 179k • 185 • 2
macadeliccc/distilabel-neurology-preferences-2k-orca-format Viewer • Updated Feb 22, 2024 • 1.99k • 23 • 1
trl-internal-testing/descriptiveness-sentiment-trl-style Viewer • Updated Apr 9, 2024 • 10.9k • 1.34k • 1
insub/imdb_prefix20_forDPO_gpt2-large-imdb-FT_siebert_sentiment-roberta-large-english Viewer • Updated Oct 22, 2023 • 50k • 53 • 2