Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Peng Liu's picture
2 1 5

Peng Liu

P3ngLiu
tianchez's profile picture ruochenx's profile picture kyusonglee's profile picture
·
  • P3ngLiu

AI & ML interests

CV, Multimodal, OVD

Recent Activity

upvoted a paper 21 days ago
VLM-R1: A Stable and Generalizable R1-style Large Vision-Language Model
liked a dataset 3 months ago
omlab/VLM-R1
reacted to tianchez's post with 🚀 3 months ago
Introducing VLM-R1! GRPO has helped DeepSeek R1 to learn reasoning. Can it also help VLMs perform stronger for general computer vision tasks? The answer is YES and it generalizes better than SFT. We trained Qwen 2.5 VL 3B on RefCOCO (a visual grounding task) and eval on RefCOCO Val and RefGTA (an OOD task). https://github.com/om-ai-lab/VLM-R1
View all activity

Organizations

Om AI Lab's profile picture

P3ngLiu's activity

No public activity

Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs