VLAA-Thinker - a UCSC-VLAA Collection

UCSC-VLAA 's Collections

GPT-Image-Edit-1.5M

m1

CLIPS

CLIPA

Recap-DataComp-1B

HQ-Edit

VLAA-Thinker

updated 6 days ago

UCSC-VLAA/VLAA-Thinker-Qwen2.5VL-3B

Image-Text-to-Text • 4B • Updated 22 days ago • 4k • 5
UCSC-VLAA/VLAA-Thinker-Qwen2.5VL-7B

Image-Text-to-Text • 8B • Updated 22 days ago • 2.16k • 2
UCSC-VLAA/VLAA-Thinker-Qwen2VL-2B

Image-Text-to-Text • 2B • Updated 22 days ago • 135 • 1
UCSC-VLAA/VLAA-Thinker-Qwen2VL-7B

Image-Text-to-Text • 8B • Updated 22 days ago • 10
UCSC-VLAA/VLAA-Thinker-Qwen2VL-7B-Zero

Image-Text-to-Text • 8B • Updated 22 days ago • 12
SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models

Paper • 2504.11468 • Published Apr 10 • 29