GenPRM
/

GenPRM-7B

RyanLiu112 commited on Apr 5

Commit

5a703df

verified ·

1 Parent(s): ad84702

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -17,8 +17,8 @@ We propose **GenPRM**, a strong generative process reward model with the followi
 GenPRM achieves state-of-the-art performance across multiple benchmarks in two key roles:
-- As a verifier: GenPRM-7B outperforms all classification-based PRMs of comparable size and even surpasses Qwen2.5-Math-PRM-72B via test-time scaling.
-- As a critic: GenPRM-7B demonstrates superior critique capabilities, achieving 3.4× greater performance gains than DeepSeekR1-Distill-Qwen-7B after 3 refinement iterations.
 ![](images/fig_head.png)

 GenPRM achieves state-of-the-art performance across multiple benchmarks in two key roles:
+- **As a verifier**: GenPRM-7B outperforms all classification-based PRMs of comparable size and even surpasses **Qwen2.5-Math-PRM-72B** via test-time scaling.
+- **As a critic**: GenPRM-7B demonstrates superior critique capabilities, achieving **3.4×** greater performance gains than DeepSeekR1-Distill-Qwen-7B after 3 refinement iterations.
 ![](images/fig_head.png)