Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2
3
Zhenghai Xue
ZhenghaiXue
Follow
21world's profile picture
1 follower
·
2 following
AI_Defender
AI & ML interests
Reinforcement Learning
Recent Activity
authored
a paper
about 1 month ago
Group-in-Group Policy Optimization for LLM Agent Training
upvoted
a
paper
about 2 months ago
Group-in-Group Policy Optimization for LLM Agent Training
upvoted
a
paper
3 months ago
JudgeLRM: Large Reasoning Models as a Judge
View all activity
Organizations
ZhenghaiXue
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
upvoted
a
paper
about 2 months ago
Group-in-Group Policy Optimization for LLM Agent Training
Paper
•
2505.10978
•
Published
May 16
•
8
upvoted
a
paper
3 months ago
JudgeLRM: Large Reasoning Models as a Judge
Paper
•
2504.00050
•
Published
Mar 31
•
61