Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
DrishtiSharma
/
dolphin-2.1-mistral-7b-dpo-ultrafeedback-binarized-preferences-ipo
like
0
PEFT
TensorBoard
Safetensors
trl
dpo
Generated from Trainer
License:
apache-2.0
Model card
Files
Files and versions
Metrics
Training metrics
Community
Use this model
main
dolphin-2.1-mistral-7b-dpo-ultrafeedback-binarized-preferences-ipo
Commit History
End of training
ee72506
verified
DrishtiSharma
commited on
Feb 22, 2024
Training in progress, step 2500
aaea155
verified
DrishtiSharma
commited on
Feb 22, 2024
Training in progress, step 2000
4e90b7d
verified
DrishtiSharma
commited on
Feb 22, 2024
Training in progress, step 1500
adf3735
verified
DrishtiSharma
commited on
Feb 22, 2024
Training in progress, step 1000
212c8b7
verified
DrishtiSharma
commited on
Feb 22, 2024
Training in progress, step 500
1e598e0
verified
DrishtiSharma
commited on
Feb 22, 2024
Training in progress, step 500
65f3aec
verified
DrishtiSharma
commited on
Feb 22, 2024
initial commit
961e754
verified
DrishtiSharma
commited on
Feb 22, 2024