Qwen2.5-7B-VL-ReAd-R

ReAd-R is a Qwen2.5-VL-7B based video understanding model fine-tuned with GRPO reinforcement learning for advertisement video reasoning. On the AdsQA benchmark, it achieves 25.0% strict / 51.5% relaxed accuracy, outperforming open-source multimodal baselines of similar size. AdsQA contains 1,544 videos, 10,962 clips, and 22.7 hours of content across five high-level tasks.

Downloads last month
4
Safetensors
Model size
8.29B params
Tensor type
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support