Update README.md
Browse files
README.md
CHANGED
@@ -57,10 +57,11 @@ with torch.no_grad():
|
|
57 |
|
58 |
## Citation
|
59 |
If you find this model helpful for your research, please cite GRM
|
60 |
-
|
61 |
@article{yang2024regularizing,
|
62 |
title={Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs},
|
63 |
author={Yang, Rui and Ding, Ruomeng and Lin, Yong and Zhang, Huan and Zhang, Tong},
|
64 |
journal={arXiv preprint arXiv:2406.10216},
|
65 |
year={2024}
|
66 |
}
|
|
|
|
57 |
|
58 |
## Citation
|
59 |
If you find this model helpful for your research, please cite GRM
|
60 |
+
```
|
61 |
@article{yang2024regularizing,
|
62 |
title={Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs},
|
63 |
author={Yang, Rui and Ding, Ruomeng and Lin, Yong and Zhang, Huan and Zhang, Tong},
|
64 |
journal={arXiv preprint arXiv:2406.10216},
|
65 |
year={2024}
|
66 |
}
|
67 |
+
```
|