🤝 Open to Collab

Kevin Lin

KevinQHLin

42 97 44

https://qhlin.me/

KevinQHLin
QinghongLin
kevinqhlin

AI & ML interests

Vision-Language Model, Video Understanding, Agent

Recent Activity

upvoted a paper 3 days ago

HumanCLAW: Can Vision-Language Models Act Through a Body?

upvoted a paper 25 days ago

EdgeBench: Unveiling Scaling Laws of Learning from Real-World Environments

upvoted a paper 25 days ago

Parallelized Autoregressive Decoding for Omni-Modal Dense Video Captioning

View all activity

Organizations

Articles 1

Article

When Vision Meets Code

Collections 7

View 7 collections

Papers 34

spaces 2

Paper2Poster

🚀

UniVTG

👁

models 1

KevinQHLin/VLog

Updated Mar 12, 2025

datasets 2

KevinQHLin/RICO

Preview • Updated Feb 11, 2025 • 10

KevinQHLin/ScreenSpot

Viewer • Updated Jan 1, 2025 • 1.27k • 385 • 1

Kevin Lin

AI & ML interests

Recent Activity

Organizations

Articles 1

When Vision Meets Code

Collections 7

showlab/ShowUI-2B

ShowUI: One Vision-Language-Action Model for GUI Visual Agent

ShowUI

FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection

ServiceNow/GroundCUA

ServiceNow/ui-vision

ServiceNow/VideoCUA

Grounding Computer Use Agents on Human Demonstrations

showlab/ShowUI-2B

ShowUI: One Vision-Language-Action Model for GUI Visual Agent

ShowUI

FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection

ServiceNow/GroundCUA

ServiceNow/ui-vision

ServiceNow/VideoCUA

Grounding Computer Use Agents on Human Demonstrations

Papers 34

spaces 2

Paper2Poster

UniVTG

models 1

KevinQHLin/VLog

datasets 2

KevinQHLin/RICO

KevinQHLin/ScreenSpot

Kevin Lin

AI & ML interests

Recent Activity

Organizations

Articles 1

When Vision Meets Code

Collections 7

ShowUI

ShowUI

Papers 34

spaces 2 Sort: Recently updated

Paper2Poster

UniVTG

models 1

datasets 2 Sort: Recently updated

spaces 2

datasets 2