popcat19's picture
5 86

popcat19

PopCat19
ยท

AI & ML interests

None yet

Recent Activity

liked a model 12 days ago
SicariusSicariiStuff/X-Ray_Alpha
liked a model 14 days ago
Sao10K/70B-L3.3-Cirrus-x1
upvoted a collection 16 days ago
Hamanasu
View all activity

Organizations

None yet

PopCat19's activity

reacted to tianchez's post with ๐Ÿš€๐Ÿ‘ about 2 months ago
view post
Post
4221
Introducing VLM-R1!

GRPO has helped DeepSeek R1 to learn reasoning. Can it also help VLMs perform stronger for general computer vision tasks?

The answer is YES and it generalizes better than SFT. We trained Qwen 2.5 VL 3B on RefCOCO (a visual grounding task) and eval on RefCOCO Val and RefGTA (an OOD task).

https://github.com/om-ai-lab/VLM-R1
ยท