argilla/distilabel-intel-orca-dpo-pairs
Viewer
•
Updated
•
12.9k
•
6.72k
•
176
LLMs, NLP, Alignment, DPO, RLHF, data labeling, text-classification, text-generation, token-classification