What is a Computer Use Agent?
You might’ve heard of Claude Computer Use or OpenAI’s Computer Using Agent. These are powerful tools that can convert natural language into actions on the computer. However, you’d otherwise need to write your own code to convert these actions into Playwright commands. Stagehand not only handles the execution of Computer Use outputs, but also lets you hot-swap between OpenAI and Anthropic models with one line of code.How to use a Computer Use Agent in Stagehand
Stagehand lets you use Computer Use Agents with one line of code:IMPORTANT! Configure your browser dimensionsComputer Use Agents will often return XY-coordinates to click on the screen, so you’ll need to configure your browser dimensions.If not specified, the default browser dimensions are 1024x768. You can also configure the browser dimensions in the
browserbaseSessionCreateParams
or localBrowserLaunchOptions
options.Configuring browser dimensions
Browser configuration differs by environment:- BROWSERBASE
- LOCAL
Direct your Computer Use Agent
Callexecute
on the agent to assign a task to the agent.