ODIN-RM & RLHF models The ODIN and the policies trained by ODIN Lichang-Chen/ODIN_L1_O1 Text Generation • Updated Feb 29, 2024 • 8 Lichang-Chen/ODIN_L1 Text Generation • Updated Feb 5, 2024 • 3 Lichang-Chen/ODIN-ReMax-L230-best Text Generation • Updated Feb 12, 2024 • 6 Lichang-Chen/ODIN-ReMax-L255-best Text Generation • Updated Feb 12, 2024 • 3
ODIN-RM & RLHF models The ODIN and the policies trained by ODIN Lichang-Chen/ODIN_L1_O1 Text Generation • Updated Feb 29, 2024 • 8 Lichang-Chen/ODIN_L1 Text Generation • Updated Feb 5, 2024 • 3 Lichang-Chen/ODIN-ReMax-L230-best Text Generation • Updated Feb 12, 2024 • 6 Lichang-Chen/ODIN-ReMax-L255-best Text Generation • Updated Feb 12, 2024 • 3