ying zhu's picture

13 2

ying zhu

StellaZYing

StellaZYing

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 hours ago

Cooper: Co-Optimizing Policy and Reward Models in Reinforcement Learning for Large Language Models

upvoted a paper 2 days ago

Test-Time Reinforcement Learning for GUI Grounding via Region Consistency

liked a dataset 3 days ago

wangzx1210/OmniEAR

View all activity

Organizations

None yet

models 0

None public yet

datasets 0

None public yet