The SFT cold start model trained by the Video-R1-COT-165k dataset.

This intermediate checkpoint can be used as the base model for RL training on the Video-R1-260k dataset.

Please refer to: https://github.com/tulerfeng/Video-R1

Downloads last month
770
Safetensors
Model size
8.29B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Video-R1/Qwen2.5-VL-7B-COT-SFT

Base model

Qwen/Qwen2.5-7B
Finetuned
(1815)
this model

Dataset used to train Video-R1/Qwen2.5-VL-7B-COT-SFT