Error bbox locating
#3
by
wizkd
- opened
I use Midscene.js in web, the action is to click the serach box. but it click an error location. Is there any problem of coordinate mapping?
What's more, the result of test case in https://github.com/bytedance/UI-TARS/blob/main/README_deploy.md is "
Thought: 我看到系统设置界面已经打开了,但这里显示的都是些基本的系统参数,比如缓存大小和内存使用情况。要设置图片的颜色模式,我得先找到"Color Management"这个选项。让我在左侧的设置列表中找找看,应该就在这些选项里面。
Action: click(start_box='(197,549)')"
which return an wrong box too.
I seem to have the same problem, I have encountered inaccurate coordinates when using Midscene and UI-TARS-desktop
Perhaps this can solve the problem here.
I tested their new code but seems the issues is still there, in OSWorld, the model seems to also click on the same location multiple times