Yuhui Wang
Michael109
·
AI & ML interests
None yet
Recent Activity
updated
a collection
about 17 hours ago
RL
updated
a collection
4 days ago
RL
updated
a collection
4 days ago
RL
Organizations
Collections
2
-
CPGD: Toward Stable Rule-based Reinforcement Learning for Language Models
Paper • 2505.12504 • Published • 22 -
Web-Shepherd: Advancing PRMs for Reinforcing Web Agents
Paper • 2505.15277 • Published • 94 -
T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT
Paper • 2505.00703 • Published • 41 -
OpenThinkIMG: Learning to Think with Images via Visual Tool Reinforcement Learning
Paper • 2505.08617 • Published • 39
models
0
None public yet
datasets
0
None public yet