Improve model card for GRPO-CARE
Browse filesThis PR updates the model card by:
- Adding the `pipeline_tag: video-text-to-text` for better discoverability.
- Adding the `library_name: transformers` to indicate the primary library used.
- Including a direct link to the paper within the README.
README.md
CHANGED
@@ -1,7 +1,11 @@
|
|
1 |
---
|
2 |
license: apache-2.0
|
|
|
|
|
3 |
---
|
4 |
|
|
|
|
|
5 |
Code released at [GRPO-CARE](https://github.com/TencentARC/GRPO-CARE).
|
6 |
|
7 |
## Citation
|
|
|
1 |
---
|
2 |
license: apache-2.0
|
3 |
+
library_name: transformers
|
4 |
+
pipeline_tag: video-text-to-text
|
5 |
---
|
6 |
|
7 |
+
This repository contains the GRPO-CARE model, presented in the paper [GRPO-CARE: Consistency-Aware Reinforcement Learning for Multimodal Reasoning](https://huggingface.co/papers/2506.16141).
|
8 |
+
|
9 |
Code released at [GRPO-CARE](https://github.com/TencentARC/GRPO-CARE).
|
10 |
|
11 |
## Citation
|