Agents

Quickstart

Create your first Agent run and poll for the result.

How it works

Learn how Agents work, what tools they use, and how runs progress.

Node.js SDK

Create Agents and runs from Node.js.

Python SDK

Create Agents and runs from Python.

Run an Agent

Endpoint reference for starting an Agent run.

Browser agents use case

Understand when to use Agents versus building a custom browser-agent loop.

Overview

Agents are the lowest-friction way to automate work on the web with Browserbase. Describe the task in natural language, and Browserbase runs an autonomous Agent that can browse, click, type, search, extract data, use files, and return a result. Use Agents when you want Browserbase to own the browser loop for you. You do not need to write Playwright scripts, wire up Stagehand, provision model providers, or deploy runtime infrastructure. Each run gets a dedicated Browserbase browser session with the same observability tools available to standard sessions.

Quick reference

Concept	What it does
Agent	The autonomous Browserbase Agent that can perform one or more runs.
Run	A single natural language task executed by an Agent.
Messages	The chronological transcript of what the Agent did during a run.
Session	The Browserbase browser session that backs the run for live view, recording, and replay.

How it works

Send a task

Create a run with a natural language instruction and optional browser settings.

Browserbase starts the Agent

Browserbase creates the browser session, runtime, tools, and model loop needed to complete the task.

The Agent completes the run

The Agent chooses the right tools, navigates the web, interacts with pages, and extracts the requested result.

Inspect the outcome

Poll the run, read its messages, and review the browser session when you need to debug.

Create a run

Create a run with the SDK or /v1/agents/runs endpoint. The response includes an agentId and a runId.

Node.js
Python
cURL

Node.js

import Browserbase from "@browserbasehq/sdk";

const bb = new Browserbase({
  apiKey: process.env.BROWSERBASE_API_KEY!,
});

const { agentId, runId } = await bb.agents.runs.create({
  task: "Go to Hacker News and return the top 3 stories with their titles and URLs",
});

console.log({ agentId, runId });

import os
from browserbase import Browserbase

bb = Browserbase(api_key=os.environ["BROWSERBASE_API_KEY"])

run = bb.agents.runs.create(
    task="Go to Hacker News and return the top 3 stories with their titles and URLs",
)

print(run.agent_id, run.run_id)

curl -X POST https://api.browserbase.com/v1/agents/runs \
  --header "x-bb-api-key: $BROWSERBASE_API_KEY" \
  --header "Content-Type: application/json" \
  --data '{
    "task": "Go to Hacker News and return the top 3 stories with their titles and URLs"
  }'

Run lifecycle

Agent runs are asynchronous. Create a run, poll it until it reaches a terminal state, then read the final result and session details.

Status	Description
`PENDING`	The run is queued.
`RUNNING`	The Agent is working on the task.
`COMPLETED`	The run finished successfully.
`FAILED`	The run failed.
`STOPPED`	The run was stopped by request.
`TIMED_OUT`	The run exceeded its timeout.

Built-in capabilities

Agents use the Browserbase platform out of the box:

Browser automation: navigate pages, click, type, wait, observe, and extract using Stagehand.
Search: discover relevant pages before opening a browser.
Files: read, write, and process files in the run environment.
Shell: run commands in the sandbox when code or CLIs are the fastest path.
Identity and access: use Browserbase browser settings for proxies, CAPTCHA solving, and Agent Identity.
Observability: inspect live sessions, recordings, replay, and run messages.

When to use Agents

Use Agents when	Use another Browserbase tool when
You want an autonomous Agent to complete a web task from a natural language instruction.	You need deterministic browser control in your own code: use browser sessions.
You do not want to maintain Playwright, Stagehand, model, or runtime orchestration.	You want to deploy custom browser logic: use Functions.
You want built-in search, browsing, files, shell, and observability in one run.	You only need cheap read-only recon: use Search or Fetch.

Next steps

Agents quickstart

Create and poll your first Agent run.

How it works

Learn the execution loop, built-in tools, run lifecycle, and observability.

API reference

See request and response fields for creating a run.

Node.js SDK

Use every Agents SDK method from Node.js.

Python SDK

Use every Agents SDK method from Python.

Agent Identity

Give Agents stronger trust signals for protected websites.

Enterprise data controls

Agents is currently outside the scope of Zero Data Retention (ZDR) and Bring Your Own Storage (BYOS). Contact us if you need ZDR, BYOS, or a BYO model key.

Welcome

Platform

Use cases

Optimization

Account

Quickstart

How it works

Node.js SDK

Python SDK

Run an Agent

Browser agents use case

Overview

Quick reference

How it works

Create a run

Run lifecycle

Built-in capabilities

When to use Agents

Next steps

Agents quickstart

How it works

API reference

Node.js SDK

Python SDK

Agent Identity

Enterprise data controls

Quickstart

How it works

Node.js SDK

Python SDK

Run an Agent

Browser agents use case

​Overview

​Quick reference

​How it works

​Create a run

​Run lifecycle

​Built-in capabilities

​When to use Agents

​Next steps

Agents quickstart

How it works

API reference

Node.js SDK

Python SDK

Agent Identity

​Enterprise data controls

Overview

Quick reference

How it works

Create a run

Run lifecycle

Built-in capabilities

When to use Agents

Next steps

Enterprise data controls