RLHFlow

university
Activity Feed

AI & ML interests

Workflow of Reinforcement Learning from Human Feedback (RLHF). Blog: https://rlhflow.github.io/

Recent Activity

hendrydong  updated a collection about 1 month ago
Minimal-RL
hendrydong  updated a collection about 1 month ago
Minimal-RL
hendrydong  updated a model about 1 month ago
RLHFlow/Qwen2.5-Math-7B-Zero-RAFTpp
View all activity