Upload data_reward_model_training.csv
Browse filesData for training the reward model.
`chosen`: formatted from textual label
`rejected`: generated by gpt2
data_reward_model_training.csv
ADDED
The diff for this file is too large to render.
See raw diff
|
|