Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
argilla 's Collections
Synthetic Data Generator
Datasets built with ⚗️ distilabel
Open Image Generation Models
Argilla v2.0 compatible datasets
Notus 7B v1
Notux 8x7B v1
DIBT Prompt collective SPIN
Preference Datasets for DPO
Preference Datasets for KTO
Domain Specific Data

Preference Datasets for KTO

updated Dec 11, 2024

This collection contains a list of curated preference datasets for KTO fine-tuning for intent alignment of LLMs through signals.

Upvote
15

  • argilla/ultrafeedback-binarized-preferences-cleaned-kto

    Viewer • Updated Mar 19, 2024 • 231k • 103 • 8

    Note KTO transformed version of "argilla/ultrafeedback-binarized-preferences-cleaned".


  • argilla/distilabel-intel-orca-kto

    Viewer • Updated Mar 19, 2024 • 23.1k • 41 • 7

    Note KTO transformed version of "argilla/distilabel-intel-orca-dpo-pairs"


  • argilla/distilabel-capybara-kto-15k-binarized

    Viewer • Updated Mar 19, 2024 • 15.1k • 51 • 5

    Note KTO transformed version of "argilla/distilabel-capybara-dpo-7k-binarized".


  • argilla/kto-mix-15k

    Viewer • Updated Apr 19, 2024 • 15.3k • 133 • 13

    Note KTO transformed version of "argilla/dpo-mix-7k".


  • KTO: Model Alignment as Prospect Theoretic Optimization

    Paper • 2402.01306 • Published Feb 2, 2024 • 17
Upvote
15
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs