haihp02/Qwen2.5-1.5B-e286e9d0-2a8c-4ad7-9ca3-c5c8dd364d12-DPO_layer_wise_lr Text Generation • Updated about 3 hours ago
haihp02/Qwen2.5-1.5B-e286e9d0-2a8c-4ad7-9ca3-c5c8dd364d12-DPO Text Generation • Updated about 14 hours ago
haihp02/SmolLM2-1.7B-c4a9c113-282c-4589-a595-d30f87e61f07-DPO_layer_wise_lr Text Generation • Updated about 16 hours ago
haihp02/SmolLM2-1.7B-c4a9c113-282c-4589-a595-d30f87e61f07-DPO Text Generation • Updated about 21 hours ago
haihp02/Qwen2.5-1.5B-e286e9d0-2a8c-4ad7-9ca3-c5c8dd364d12-SFT_DPO_layer_wise_lr Text Generation • Updated about 23 hours ago
haihp02/SmolLM2-1.7B-c4a9c113-282c-4589-a595-d30f87e61f07-SFT_DPO_layer_wise_lr Text Generation • Updated 1 day ago
haihp02/SmolLM2-1.7B-c4a9c113-282c-4589-a595-d30f87e61f07-SFT_DPO Text Generation • Updated 1 day ago
haihp02/Qwen2.5-1.5B-e286e9d0-2a8c-4ad7-9ca3-c5c8dd364d12-SFT_DPO Text Generation • Updated 1 day ago
haihp02/Qwen2-1.5B-Instruct-f9f0d509-e422-4854-a6db-be83fbb7d22e-dpo-tuned-only-merged Text Generation • Updated 17 days ago • 9
haihp02/SmolLM-1.7B-f9f0d509-e422-4854-a6db-be83fbb7d22e-dpo-tuned-only-merged Text Generation • Updated 18 days ago • 14