Models

Stagehand uses Large Language Models (LLMs) to understand web pages, plan actions, and interact with complex interfaces. The choice of LLM significantly impacts your automation’s accuracy, speed, and cost.

Model Evaluation

Find more details about how to choose the right model on our Model Evaluation page.

Why LLM Choice Matters

Accuracy: Better models provide more reliable element detection and action planning
Speed: Faster models reduce automation latency
Cost: Different providers offer varying pricing structures
Reliability: Structured output support ensures consistent automation behavior

Find more details about how to choose the right model on our Model Evaluation page.

Small models on Ollama struggle with consistent structured outputs. While technically supported, we don’t recommend them for production Stagehand workflows.

Environment Variables Setup

Set up your API keys before configuring Stagehand:

# Choose one or more providers
OPENAI_API_KEY=your_openai_key_here
ANTHROPIC_API_KEY=your_anthropic_key_here
GOOGLE_API_KEY=your_google_key_here
GROQ_API_KEY=your_groq_key_here

Supported Providers

Stagehand supports major LLM providers with structured output capabilities:

Production-Ready Providers

Provider	Best Models	Strengths	Use Case
OpenAI	`gpt-4.1`, `gpt-4.1-mini`	High accuracy, reliable	Production, complex sites
Anthropic	`claude-3-7-sonnet-latest`	Excellent reasoning	Complex automation tasks
Google	`gemini-2.5-flash`, `gemini-2.5-pro`	Fast, cost-effective	High-volume automation

Additional Providers

Show More Providers

Groq - llama-3.3-70b-versatile (Good for speed critical applications)
xAI - grok-beta (Good for complex reasoning)
Azure - Enterprise OpenAI deployment
Cerebras - High-speed inference
TogetherAI - Open-source models
Mistral - mixtral-8x7b-32768 (European option)
DeepSeek - Cost-effective alternative
Perplexity - Real-time web data
Ollama - Local deployment (limited accuracy)
Run any model included in AI SDK - Find supported models in the Vercel AI SDK (Follow the guide here to get started.)

Basic Configuration

Model Name Format

Stagehand uses the format provider/model-name for model specification. Examples:

OpenAI: openai/gpt-4.1
Anthropic: anthropic/claude-3-7-sonnet-latest
Google: google/gemini-2.5-flash (Recommended)

Quick Start Examples

Google (Recommended)
OpenAI
Anthropic

import { Stagehand } from "@browserbasehq/stagehand";

const stagehand = new Stagehand({
  modelName: "google/gemini-2.5-flash",
  modelClientOptions: {
    apiKey: process.env.GOOGLE_API_KEY,
  },
});

Custom LLM Integration

Custom LLMs are currently only supported in TypeScript.

Integrate any LLM with Stagehand using custom clients. The only requirement is structured output support for consistent automation behavior.

Vercel AI SDK

The Vercel AI SDK is a popular library for interacting with LLMs. You can use any of the providers supported by the Vercel AI SDK to create a client for your model, as long as they support structured outputs. Vercel AI SDK supports providers for OpenAI, Anthropic, and Google, along with support for Amazon Bedrock and Azure OpenAI. To get started, you’ll need to install the ai package and the provider you want to use. For example, to use Amazon Bedrock, you’ll need to install the @ai-sdk/amazon-bedrock package. You’ll also need to import the Vercel AI SDK external client which is exposed as AISdkClient to create a client for your model.

npm
pnpm
yarn

npm install ai @ai-sdk/amazon-bedrock

To get started, you can use the Vercel AI SDK external client which is exposed as AISdkClient to create a client for your model.

// Install/import the provider you want to use.
// For example, to use OpenAI, import `openai` from @ai-sdk/openai
import { bedrock } from "@ai-sdk/amazon-bedrock";
import { AISdkClient } from "@browserbasehq/stagehand";

const stagehand = new Stagehand({
  llmClient: new AISdkClient({
	model: bedrock("anthropic.claude-3-7-sonnet-20250219-v1:0"),
  }),
});

Troubleshooting

Common Issues

Model doesn't support structured outputs

Error: Model does not support structured outputsSolution: Use models that support function calling/structured outputs. The minimum requirements are:

Model must support JSON/structured outputs
Model must have strong reasoning capabilities
Model must be able to handle complex instructions

For each provider, use their latest models that meet these requirements. Some examples:

OpenAI: GPT-4 series or newer
Anthropic: Claude 3 series or newer
Google: Gemini 2 series or newer
Other providers: Latest models with structured output support

Note: Avoid base language models without structured output capabilities or fine-tuning for instruction following. When in doubt, check our Model Evaluation page for up-to-date recommendations.

Authentication errors

Error: Invalid API key or UnauthorizedSolution:

Verify your environment variables are set correctly
Check API key permissions and quotas
Ensure you’re using the correct API key for the provider
For Anthropic, make sure you have access to the Claude API

Inconsistent automation results

Symptoms: Actions work sometimes but fail other timesCauses & Solutions:

Weak models: Use more capable models - check our Model Evaluation page for current recommendations
High temperature: Set temperature to 0 for deterministic outputs
Complex pages: Switch to models with higher accuracy scores on our Model Evaluation page
Rate limits: Implement retry logic with exponential backoff
Context limits: Reduce page complexity or use models with larger context windows
Prompt clarity: Ensure your automation instructions are clear and specific

Slow performance

Issue: Automation takes too long to respondSolutions:

Use fast models: Choose models optimized for speed
- Any model with < 1s response time
- Models with “fast” or “flash” variants
Optimize settings:
- Use verbose: 0 to minimize token usage
- Set temperature to 0 for fastest processing
- Keep max tokens as low as possible
Consider local deployment: Local models can provide lowest latency
Batch operations: Group multiple actions when possible

High costs

Issue: LLM usage costs are too highCost Optimization Strategies:

Switch to cost-effective models:
- Check our Model Evaluation page for current cost-performance benchmarks
- Choose models with lower cost per token that still meet accuracy requirements
- Consider models optimized for speed to reduce total runtime costs
Optimize token usage:
- Set verbose: 0 to reduce logging overhead
- Use concise prompts and limit response length
Smart model selection: Start with cheaper models, fallback to premium ones only when needed
Cache responses: Implement LLM response caching for repeated automation patterns
Monitor usage: Set up billing alerts and track costs per automation run
Batch processing: Process multiple similar tasks together

Next Steps

Choose Models

See our Model Evaluation page

Test Models

Evaluate performance on your specific use cases in our Model Evaluation guide

Track Costs

Monitor token usage and set alerts using our Observability tools

Cache Results

Store successful patterns using our Caching Guide

First Steps

The Basics

Configuration

Best Practices

Integrations

Reference

Model Evaluation

Why LLM Choice Matters

Environment Variables Setup

Supported Providers

Production-Ready Providers

Additional Providers

Basic Configuration

Model Name Format

Quick Start Examples

Custom LLM Integration

Vercel AI SDK

Troubleshooting

Common Issues

Next Steps

Choose Models

Test Models

Track Costs

Cache Results

First Steps

The Basics

Configuration

Best Practices

Integrations

Reference

Model Evaluation

​Why LLM Choice Matters

​Environment Variables Setup

​Supported Providers

​Production-Ready Providers

​Additional Providers

​Basic Configuration

​Model Name Format

​Quick Start Examples

​Custom LLM Integration

​Vercel AI SDK

​Troubleshooting

​Common Issues

​Next Steps

Choose Models

Test Models

Track Costs

Cache Results

Why LLM Choice Matters

Environment Variables Setup

Supported Providers

Production-Ready Providers

Additional Providers

Basic Configuration

Model Name Format

Quick Start Examples

Custom LLM Integration

Vercel AI SDK

Troubleshooting

Common Issues

Next Steps