Safetensors
English
qwen2
RyanLiu112 commited on
Commit
5a703df
·
verified ·
1 Parent(s): ad84702

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -17,8 +17,8 @@ We propose **GenPRM**, a strong generative process reward model with the followi
17
 
18
  GenPRM achieves state-of-the-art performance across multiple benchmarks in two key roles:
19
 
20
- - As a verifier: GenPRM-7B outperforms all classification-based PRMs of comparable size and even surpasses Qwen2.5-Math-PRM-72B via test-time scaling.
21
- - As a critic: GenPRM-7B demonstrates superior critique capabilities, achieving 3.4× greater performance gains than DeepSeekR1-Distill-Qwen-7B after 3 refinement iterations.
22
 
23
  ![](images/fig_head.png)
24
 
 
17
 
18
  GenPRM achieves state-of-the-art performance across multiple benchmarks in two key roles:
19
 
20
+ - **As a verifier**: GenPRM-7B outperforms all classification-based PRMs of comparable size and even surpasses **Qwen2.5-Math-PRM-72B** via test-time scaling.
21
+ - **As a critic**: GenPRM-7B demonstrates superior critique capabilities, achieving **3.4×** greater performance gains than DeepSeekR1-Distill-Qwen-7B after 3 refinement iterations.
22
 
23
  ![](images/fig_head.png)
24