Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
hkust-nlp
's Collections
RL-Verifier-Pitfalls
Laser
SimpleRL-Zoo
SimpleRL
PreSelect
M-STAR
CodeI/O
Deita
🎯DART-Math
SimpleRL
updated
Feb 19
The collection for the Project "Simple Reinforcement Learning for Reasoning"
Upvote
7
hkust-nlp/Qwen-2.5-Math-7B-SimpleRL-Zero
8B
•
Updated
Feb 23
•
123
•
3
hkust-nlp/Qwen-2.5-Math-7B-SimpleRL
8B
•
Updated
Feb 23
•
40
•
4
Upvote
7
+3
Share collection
View history
Collection guide
Browse collections