Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2
3
Zhenghai Xue
ZhenghaiXue
Follow
21world's profile picture
1 follower
·
2 following
AI_Defender
AI & ML interests
Reinforcement Learning
Recent Activity
authored
a paper
about 2 months ago
Group-in-Group Policy Optimization for LLM Agent Training
upvoted
a
paper
about 2 months ago
Group-in-Group Policy Optimization for LLM Agent Training
upvoted
a
paper
3 months ago
JudgeLRM: Large Reasoning Models as a Judge
View all activity
Organizations
ZhenghaiXue
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
authored
a paper
about 2 months ago
Group-in-Group Policy Optimization for LLM Agent Training
Paper
•
2505.10978
•
Published
May 16
•
8
upvoted
a
paper
about 2 months ago
Group-in-Group Policy Optimization for LLM Agent Training
Paper
•
2505.10978
•
Published
May 16
•
8
upvoted
a
paper
3 months ago
JudgeLRM: Large Reasoning Models as a Judge
Paper
•
2504.00050
•
Published
Mar 31
•
61
liked
a model
4 months ago
deepseek-ai/DeepSeek-R1
Text Generation
•
685B
•
Updated
Mar 27
•
604k
•
•
12.4k
liked
a model
7 months ago
Skywork/Skywork-o1-Open-PRM-Qwen-2.5-7B
Text Classification
•
Updated
May 14
•
870
•
50
liked
a model
8 months ago
OpenRLHF/Mistral-7b-PRM-Math-Shepherd
7B
•
Updated
Oct 30, 2024
•
12
•
1