Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
DrishtiSharma
/
dolphin-mistral-dpo-ultrafeedback-binarized-preferences-kto_pair
like
0
PEFT
TensorBoard
Safetensors
trl
dpo
Generated from Trainer
License:
apache-2.0
Model card
Files
Files and versions
Metrics
Training metrics
Community
Use this model
main
dolphin-mistral-dpo-ultrafeedback-binarized-preferences-kto_pair
Commit History
End of training
b7a7fde
verified
DrishtiSharma
commited on
Feb 22, 2024
Training in progress, step 2500
fb27bec
verified
DrishtiSharma
commited on
Feb 22, 2024
Training in progress, step 2000
f764249
verified
DrishtiSharma
commited on
Feb 22, 2024
Training in progress, step 1500
36d08ad
verified
DrishtiSharma
commited on
Feb 22, 2024
Training in progress, step 1000
485f2c7
verified
DrishtiSharma
commited on
Feb 22, 2024
Training in progress, step 500
d9f60e6
verified
DrishtiSharma
commited on
Feb 22, 2024
initial commit
88d0ff2
verified
DrishtiSharma
commited on
Feb 22, 2024