Ray2333
/

GRM-llama3-8B-distill

Text Classification

text-generation-inference

Model card Files Files and versions Community

Ray2333 commited on Jul 5, 2024

Commit

83be0dd

·

verified ·

1 Parent(s): fcd660c

Update README.md

Files changed (1) hide show

README.md +2 -1

README.md CHANGED Viewed

@@ -57,10 +57,11 @@ with torch.no_grad():
 ## Citation
 If you find this model helpful for your research, please cite GRM
 @article{yang2024regularizing,
   title={Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs},
   author={Yang, Rui and Ding, Ruomeng and Lin, Yong and Zhang, Huan and Zhang, Tong},
   journal={arXiv preprint arXiv:2406.10216},
   year={2024}
 }

 ## Citation
 If you find this model helpful for your research, please cite GRM
+```
 @article{yang2024regularizing,
   title={Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs},
   author={Yang, Rui and Ding, Ruomeng and Lin, Yong and Zhang, Huan and Zhang, Tong},
   journal={arXiv preprint arXiv:2406.10216},
   year={2024}
 }
+```