RLHFlow

university
Activity Feed

AI & ML interests

Workflow of Reinforcement Learning from Human Feedback (RLHF). Blog: https://rlhflow.github.io/

Recent Activity

weqweasdas  updated a dataset about 1 month ago
RLHFlow/self_rewarding_turn2_example
weqweasdas  published a dataset about 1 month ago
RLHFlow/self_rewarding_turn2_example
View all activity