Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
5
17
20
Ganqu Cui
ganqu
Follow
BryantMcGill's profile picture
lindsay-qu's profile picture
21world's profile picture
20 followers
·
2 following
cgq15
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
1 day ago
SSRL: Self-Search Reinforcement Learning
liked
a model
12 days ago
openbmb/MiniCPM-V-4
upvoted
a
paper
about 2 months ago
RLPR: Extrapolating RLVR to General Domains without Verifiers
View all activity
Organizations
Articles
1
Article
29
Process Reinforcement through Implicit Rewards
Papers
16
arxiv:
2505.22617
arxiv:
2504.16084
arxiv:
2504.14945
arxiv:
2503.21614
Expand 16 papers
models
0
None public yet
datasets
1
ganqu/openbackdoor
Preview
•
Updated
Oct 23, 2024
•
16