hkust-nlp/SimpleRL-Zoo-Data
Viewer
•
Updated
•
53.1k
•
391
•
3
The collection for the Paper "SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild"