CodeGoat24 commited on
Commit
f157db6
·
verified ·
1 Parent(s): 373d46a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -3
README.md CHANGED
@@ -112,9 +112,9 @@ print(text_outputs[0])
112
  ## Citation
113
 
114
  ```
115
- @article{UnifiedReward-Think,
116
- title={Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-Tuning.},
117
- author={Wang, Yibin and Li, Zhimin and Zang, Yuhang and Wang, Chunyu and Lu, Qinglin, and Jin, Cheng and Wang, Jiaqi},
118
  journal={arXiv preprint arXiv:2505.03318},
119
  year={2025}
120
  }
 
112
  ## Citation
113
 
114
  ```
115
+ @article{unifiedreward-think,
116
+ title={Unified multimodal chain-of-thought reward model through reinforcement fine-tuning},
117
+ author={Wang, Yibin and Li, Zhimin and Zang, Yuhang and Wang, Chunyu and Lu, Qinglin and Jin, Cheng and Wang, Jiaqi},
118
  journal={arXiv preprint arXiv:2505.03318},
119
  year={2025}
120
  }