Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
attention-avengers
's Collections
DPO Training
DPO Training
updated
Jun 12
It contains Qwen1.5-0.5B-Chat version that has been retrained using EPFL data and ...
Upvote
-
attention-avengers/Qwen1.5-0.5B-Chat-EPFL-ORCA-DPO
Text Generation
•
Updated
May 30
attention-avengers/Qwen1.5-0.5B-Chat-ORCA-EPFL-cDPO
Text Generation
•
Updated
May 30
•
4
attention-avengers/Qwen1.5-0.5B-Chat-EPFL-ORCA-cDPO
Text Generation
•
Updated
May 29
•
4
attention-avengers/Qwen1.5-0.5B-Chat-EPFL-cDPO
Text Generation
•
Updated
May 28
•
2
attention-avengers/Qwen1.5-0.5B-Chat-ORCA-cDPO
Text Generation
•
Updated
May 29
•
2
Upvote
-
Share collection
View history
Collection guide
Browse collections