Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
kevinpro
's Collections
R-PRM
MAPO: Multilingual Reasoning with Preference Optimization
R-PRM
updated
4 days ago
R-PRM: Reasoning-Driven Process Reward Modeling
Upvote
2
kevinpro/R-PRM-7B-DPO
Text Generation
•
Updated
7 days ago
•
8
R-PRM: Reasoning-Driven Process Reward Modeling
Paper
•
2503.21295
•
Published
8 days ago
kevinpro/R-PRM
Viewer
•
Updated
7 days ago
•
594k
•
51
Upvote
2
Share collection
View history
Collection guide
Browse collections