MahmoudMohamed
/

Reward_Model

Text Classification

Generated from Trainer

Model card Files Files and versions

Metrics Training metrics Community

MahmoudMohamed commited on May 8, 2024

Commit

ed26bc7

·

verified ·

1 Parent(s): 9576c36

Update README.md

Files changed (1) hide show

README.md +3 -1

README.md CHANGED Viewed

@@ -10,6 +10,8 @@ metrics:
 model-index:
 - name: Reward_model
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -45,4 +47,4 @@ The following hyperparameters were used during training:
 - Transformers 4.39.3
 - Pytorch 2.1.2
 - Datasets 2.18.0
-- Tokenizers 0.15.2

 model-index:
 - name: Reward_model
   results: []
+datasets:
+- Anthropic/hh-rlhf
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 - Transformers 4.39.3
 - Pytorch 2.1.2
 - Datasets 2.18.0
+- Tokenizers 0.15.2