view article Article The N Implementation Details of RLHF with PPO By vwxyzjn and 2 others • Oct 24, 2023 • 58
LeanDojo Collection Machine learning for theorem proving in Lean: https://leandojo.org/ • 10 items • Updated Jul 23, 2024 • 2