Update README.md
Browse files
README.md
CHANGED
@@ -17,8 +17,8 @@ We propose **GenPRM**, a strong generative process reward model with the followi
|
|
17 |
|
18 |
GenPRM achieves state-of-the-art performance across multiple benchmarks in two key roles:
|
19 |
|
20 |
-
- As a verifier
|
21 |
-
- As a critic
|
22 |
|
23 |

|
24 |
|
|
|
17 |
|
18 |
GenPRM achieves state-of-the-art performance across multiple benchmarks in two key roles:
|
19 |
|
20 |
+
- **As a verifier**: GenPRM-7B outperforms all classification-based PRMs of comparable size and even surpasses **Qwen2.5-Math-PRM-72B** via test-time scaling.
|
21 |
+
- **As a critic**: GenPRM-7B demonstrates superior critique capabilities, achieving **3.4×** greater performance gains than DeepSeekR1-Distill-Qwen-7B after 3 refinement iterations.
|
22 |
|
23 |

|
24 |
|