Spaces:

allenai
/

reward-bench

Running

saumyamalik commited on 3 days ago

Commit

4a49ee3

1 Parent(s): bfbc587

update paper link

Files changed (1) hide show

leaderboard/md.py CHANGED Viewed

@@ -178,7 +178,7 @@ TOP_TEXT = """# RewardBench: Evaluating Reward Models"""
 CAPTION_V2 = f"""The *new version* of RewardBench that is based on unseen human data and designed to be substantially more difficult!
-[Code](https://github.com/allenai/reward-bench) |  [Eval. Dataset v2](https://huggingface.co/datasets/allenai/reward-bench-2) | [Results v2](https://huggingface.co/datasets/allenai/reward-bench-2-results) | [Paper](https://github.com/allenai/reward-bench/blob/main/paper-v2.pdf) | Total models: {{}} |  Last restart (PST): {current_time}"""
 CAPTION_V1 = f"""The original RewardBench -- the first reward model evaluation.

 CAPTION_V2 = f"""The *new version* of RewardBench that is based on unseen human data and designed to be substantially more difficult!
+[Code](https://github.com/allenai/reward-bench) |  [Eval. Dataset v2](https://huggingface.co/datasets/allenai/reward-bench-2) | [Results v2](https://huggingface.co/datasets/allenai/reward-bench-2-results) | [Paper](https://arxiv.org/abs/2506.01937) | Total models: {{}} |  Last restart (PST): {current_time}"""
 CAPTION_V1 = f"""The original RewardBench -- the first reward model evaluation.