Yuhan123/ppo-cn-RM-reading-level-7th-1-steps-10000-epoch-999-best-eval-score-0.316 Text Generation • 3B • Updated May 27 • 7
Yuhan123/ppo-cn-RM-reading-level-7th-1-steps-10000-epoch-999-best-eval-score-0.316 Text Generation • 3B • Updated May 27 • 7
Yuhan123/ppo-cn-RM-reading-level-7th-1-steps-10000-epoch-999-best-eval-score-0.229 Text Generation • 3B • Updated May 27 • 9
Yuhan123/ppo-cn-RM-reading-level-7th-1-steps-10000-epoch-999-best-eval-score-0.229 Text Generation • 3B • Updated May 27 • 9
Yuhan123/ppo-cn-RM-reading-level-7th-1-steps-10000-epoch-999-best-eval-score-0.340 Text Generation • 3B • Updated May 27 • 9
Yuhan123/ppo-cn-RM-reading-level-7th-1-steps-10000-epoch-999-best-eval-score-0.340 Text Generation • 3B • Updated May 27 • 9
Yuhan123/ppo-cn-RM-reading-level-12th-1-steps-10000-epoch-999-best-eval-score-0.309 Text Generation • 3B • Updated May 27 • 9
Yuhan123/ppo-cn-RM-reading-level-12th-1-steps-10000-epoch-999-best-eval-score-0.309 Text Generation • 3B • Updated May 27 • 9
Yuhan123/ppo-cn-RM-reading-level-7th-1-steps-10000-epoch-999-best-eval-score-0.361 Text Generation • 3B • Updated May 27 • 66
Yuhan123/ppo-cn-RM-reading-level-7th-1-steps-10000-epoch-999-best-eval-score-0.361 Text Generation • 3B • Updated May 27 • 66
Yuhan123/ppo-cn-RM-reading-level-grad-1-steps-10000-epoch-999-best-eval-score-0.383 Text Generation • 3B • Updated May 27 • 93
Yuhan123/ppo-cn-RM-reading-level-grad-1-steps-10000-epoch-999-best-eval-score-0.383 Text Generation • 3B • Updated May 27 • 93
Yuhan123/ppo-cn-RM-reading-level-grad-1-steps-10000-epoch-999-best-eval-score-0.398 Text Generation • 3B • Updated May 27 • 34
Yuhan123/ppo-cn-RM-reading-level-grad-1-steps-10000-epoch-999-best-eval-score-0.398 Text Generation • 3B • Updated May 27 • 34
Yuhan123/ppo-cn-RM-reading-level-grad-1-steps-10000-epoch-999-best-eval-score-0.321 Text Generation • 3B • Updated May 27 • 24
Yuhan123/ppo-cn-RM-reading-level-grad-1-steps-10000-epoch-999-best-eval-score-0.321 Text Generation • 3B • Updated May 27 • 24
Yuhan123/ppo-cn-RM-reading-level-7th-1-steps-10000-epoch-999-best-eval-score-0.337 Text Generation • 3B • Updated May 27 • 9