AIFGEN - a LifelongAlignment Collection

LifelongAlignment 's Collections

updated about 5 hours ago

Synthetic Preference Datasets for Continual Reinforcement Learning from Human Feedback - https://github.com/ComplexData-MILA/AIF-Gen