s
leosong
AI & ML interests
NLP
Recent Activity
commented on
a paper
about 1 month ago
Reinforcement Pre-Training
upvoted
a
paper
about 2 months ago
Absolute Zero: Reinforced Self-play Reasoning with Zero Data
updated
a model
4 months ago
leosong/Qwen2.5-1.5B-GRDPO
Organizations
None yet