Update README.md
#1
by
BustaHeroMax
- opened
README.md
CHANGED
@@ -6,7 +6,7 @@ library_name: transformers
|
|
6 |
pipeline_tag: image-text-to-text
|
7 |
---
|
8 |
|
9 |
-
# GUI-Actor-
|
10 |
|
11 |
This model was introduced in the paper [**GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents**](https://huggingface.co/papers/2506.03143).
|
12 |
It is developed based on [Qwen2.5-VL-3B-Instruct ](https://huggingface.co/Qwen/Qwen2.5-VL-3B-Instruct), augmented by an attention-based action head and finetuned to perform GUI grounding using the dataset [here](https://huggingface.co/datasets/cckevinn/GUI-Actor-Data).
|
|
|
6 |
pipeline_tag: image-text-to-text
|
7 |
---
|
8 |
|
9 |
+
# GUI-Actor-3B with Qwen2.5-VL-3B as backbone VLM
|
10 |
|
11 |
This model was introduced in the paper [**GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents**](https://huggingface.co/papers/2506.03143).
|
12 |
It is developed based on [Qwen2.5-VL-3B-Instruct ](https://huggingface.co/Qwen/Qwen2.5-VL-3B-Instruct), augmented by an attention-based action head and finetuned to perform GUI grounding using the dataset [here](https://huggingface.co/datasets/cckevinn/GUI-Actor-Data).
|