SongTonyLi/OpenELM-450M-SFT-D1_chosen-then-DPO_D2a-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Generation • 0.5B • Updated Sep 20, 2024 • 74
SongTonyLi/OpenELM-1_1B-SFT-D1_chosen-then-DPO_D2a-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Generation • 1B • Updated Sep 21, 2024 • 16
taicheng/zephyr-7b-align-scan-dpo0.1data-5e-07-0.01-1-cosine-8 Text Generation • 7B • Updated Sep 24, 2024 • 14
LBK95/Llama-2-7b-hf-DPO-LookAhead3_FullEval_TTree1.4_TLoop0.7_TEval0.2_Filter0.2_V3.0 Updated Sep 23, 2024 • 3