UCSC-VLAA/VLAA-Thinker-Qwen2.5VL-7B Image-Text-to-Text β’ 8B β’ Updated 22 days ago β’ 2.16k β’ 2
SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models Paper β’ 2504.11468 β’ Published Apr 10 β’ 29