metadata
license: mit
language:
- en
base_model:
- Qwen/Qwen2.5-VL-3B-Instruct
- Qwen/Qwen2.5-VL-7B-Instruct
pipeline_tag: visual-question-answering
This repository contains the model presented in GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents.
Project page: https://github.com/ritzz-ai/GUI-R1