haihp02/Qwen2.5-1.5B-e286e9d0-2a8c-4ad7-9ca3-c5c8dd364d12-DPO Text Generation • Updated about 8 hours ago
haihp02/SmolLM2-1.7B-c4a9c113-282c-4589-a595-d30f87e61f07-DPO_layer_wise_lr Text Generation • Updated about 10 hours ago
haihp02/SmolLM2-1.7B-c4a9c113-282c-4589-a595-d30f87e61f07-DPO Text Generation • Updated about 14 hours ago
haihp02/Qwen2.5-1.5B-e286e9d0-2a8c-4ad7-9ca3-c5c8dd364d12-SFT_DPO_layer_wise_lr Text Generation • Updated about 16 hours ago
haihp02/SmolLM2-1.7B-c4a9c113-282c-4589-a595-d30f87e61f07-SFT_DPO_layer_wise_lr Text Generation • Updated about 18 hours ago
haihp02/SmolLM2-1.7B-c4a9c113-282c-4589-a595-d30f87e61f07-SFT_DPO Text Generation • Updated about 22 hours ago
haihp02/Qwen2.5-1.5B-e286e9d0-2a8c-4ad7-9ca3-c5c8dd364d12-SFT_DPO Text Generation • Updated 1 day ago