Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2
3
Zhenghai Xue
ZhenghaiXue
Follow
21world's profile picture
1 follower
ยท
2 following
AI_Defender
AI & ML interests
Reinforcement Learning
Recent Activity
authored
a paper
about 1 month ago
Group-in-Group Policy Optimization for LLM Agent Training
upvoted
a
paper
about 1 month ago
Group-in-Group Policy Optimization for LLM Agent Training
upvoted
a
paper
3 months ago
JudgeLRM: Large Reasoning Models as a Judge
View all activity
Organizations
Papers
1
arxiv:
2505.10978
models
0
None public yet
datasets
0
None public yet