allenai
/

OLMo-2-1124-7B-RM-Preview

Text Generation

text-classification

Inference Endpoints

Model card Files Files and versions Community

hamishivi commited on Nov 26, 2024

Commit

7a709c2

·

verified ·

1 Parent(s): 216f4ad

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -7,7 +7,7 @@ base_model:
 - allenai/OLMo-2-1124-7B-SFT
 library_name: transformers
 datasets:
-- allenai/tulu-3-sft-olmo-2-mixture
 ---
 <img alt="OLMo Logo" src="https://huggingface.co/datasets/allenai/blog-images/resolve/main/olmo2/olmo.png" width="242px">
@@ -15,7 +15,7 @@ datasets:
 # OLMo-2-1124-7B-RM
 OLMo 2 7B RM November 2024 is reward model trained on top of the [OLMo 2 7B SFT November 2024](https://huggingface.co/allenai/OLMo2-7B-1124-SFT) model.
-It has been trained using an OLMo-specific variant of the [Tülu 3 dataset](allenai/tulu-3-sft-olmo-2-mixture) and [this preference dataset](todo).
 Tülu 3 is designed for state-of-the-art performance on a diversity of tasks in addition to chat, such as MATH, GSM8K, and IFEval.
 Check out the OLMo 2 paper (forthcoming) or [Tülu 3 paper](https://arxiv.org/abs/2411.15124) for more details!

 - allenai/OLMo-2-1124-7B-SFT
 library_name: transformers
 datasets:
+- allenai/olmo-2-1124-7b-preference-mix-for-rm
 ---
 <img alt="OLMo Logo" src="https://huggingface.co/datasets/allenai/blog-images/resolve/main/olmo2/olmo.png" width="242px">
 # OLMo-2-1124-7B-RM
 OLMo 2 7B RM November 2024 is reward model trained on top of the [OLMo 2 7B SFT November 2024](https://huggingface.co/allenai/OLMo2-7B-1124-SFT) model.
+It has been trained using an OLMo-specific variant of the [Tülu 3 dataset](allenai/tulu-3-sft-olmo-2-mixture) and [this preference dataset](https://huggingface.co/datasets/allenai/olmo-2-1124-7b-preference-mix-for-rm).
 Tülu 3 is designed for state-of-the-art performance on a diversity of tasks in addition to chat, such as MATH, GSM8K, and IFEval.
 Check out the OLMo 2 paper (forthcoming) or [Tülu 3 paper](https://arxiv.org/abs/2411.15124) for more details!