ying zhu
StellaZYing
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 2 hours ago
Cooper: Co-Optimizing Policy and Reward Models in Reinforcement Learning
for Large Language Models
upvoted
a
paper
2 days ago
Test-Time Reinforcement Learning for GUI Grounding via Region
Consistency
liked
a dataset
3 days ago
wangzx1210/OmniEAR
Organizations
None yet