openbmb
/

Eurus-RM-7b

Text Classification

feature-extraction

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

hanbin commited on Apr 4, 2024

Commit

d352c2d

·

verified ·

1 Parent(s): 4110700

Update README.md

Files changed (1) hide show

README.md +8 -0

README.md CHANGED Viewed

@@ -53,6 +53,14 @@ test("openbmb/Eurus-RM-7b")
 # Output 2: 0.7317184507846832
 ```
 ## Citation
 ```
 @misc{yuan2024advancing,

 # Output 2: 0.7317184507846832
 ```
+## Evaluation
+ - Eurus-RM-7B stands out as the best 7B RM overall and achieves similar or better performance than much larger baselines. Particularly, it outperforms GPT-4 in certain tasks.
+ - Our training objective is beneficial in improving RM performance on hard problems and reasoning.
+ - ULTRAINTERACT is compatible with other datasets like UltraFeedback and UltraSafety, and mixing these datasets can balance different RM abilities.
+ - Eurus-RM-7B improves LLMs’ reasoning performance by a large margin through reranking.
+<img src="./figures/rm_exp.png" alt="stats" style="zoom: 40%;" />
 ## Citation
 ```
 @misc{yuan2024advancing,