Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
dhruv3006 
posted an update 16 days ago
Post
1950
GPT 5 for Computer Use agents.

Same tasks, same grounding model we just swapped GPT 4o with GPT 5 as the thinking model.

Left = 4o, right = 5.

Watch GPT 5 pull away.

Reasoning model: OpenAI GPT-5

Grounding model: Salesforce GTA1-7B

Action space: CUA Cloud Instances (macOS/Linux/Windows)


The task is: "Navigate to {random_url} and play the game until you reach a score of 5/5”....each task is set up by having claude generate a random app from a predefined list of prompts (multiple choice trivia, form filling, or color matching)"


Try it yourself here : https://github.com/trycua/cua

Docs : https://docs.trycua.com/docs/agent-sdk/supported-agents/composed-agents

Interesting! Did you have a chance to try k2, glm 4.5 or sonnet 4?

In this post