One Vision-Language-Action Model for GUI Agent
Qinghong (Kevin) Lin PRO
KevinQHLin
AI & ML interests
Vision-Language Model, Video Understanding, Human-AI Interaction
Recent Activity
upvoted
a
collection
4 days ago
VideoMind
liked
a Space
4 days ago
yeliudev/VideoMind-2B
updated
a dataset
5 days ago
KevinQHLin/Videodata