Skip to main content
n8n is a fair-code licensed workflow automation platform that lets you connect hundreds of services through a visual, node-based editor. The Browserbase Agent node brings AI-powered browser automation directly into your n8n workflows, no code required. The integration uses Stagehand under the hood to drive cloud-hosted browsers on Browserbase. You provide a starting URL and a natural language instruction, and the agent navigates, interacts, and extracts data automatically. Results flow into the rest of your n8n workflow like any other node output.

Key Features

No-Code Browser Automation

Provide a URL and an instruction in plain English. The agent handles navigation, clicking, typing, and extraction, all from a visual workflow builder.

AI Agent Modes

Choose between CUA (vision-based), DOM (selector-based), or Hybrid mode depending on your use case and performance needs.

Built-in Browser Features

Advanced stealth mode, residential proxies, automatic captcha solving, ad blocking, session recording, and multi-region support out of the box.

400+ Integration Ecosystem

Chain browser automation results with n8n’s extensive node library. Send extracted data to Slack, store it in Google Sheets, trigger webhooks, and more.

Agent Modes

The node supports three modes that determine how the agent interacts with web pages:
ModeDescriptionBest For
CUAComputer Use Agent, uses vision and screen coordinatesComplex UIs, visual interactions, dynamic content
DOMUses DOM selectors, works with any LLMSpeed, simple pages, structured content
HybridCombines vision and DOM approachesFallback reliability, mixed content types

Model Configuration

The Browserbase Agent node requires two model selections:
  • Driver Model: Powers the browser session (navigation, DOM interactions)
  • Agent Model: Orchestrates the high-level task execution
You provide a single Model API Key in your credentials. Both the Driver Model and Agent Model must be from the same provider. For example, if you select a Gemini driver model, your agent model must also be a Google model, and your Model API Key must be a Google API key.
Supported providers: Google (Gemini), OpenAI (GPT-4o, Computer Use Preview), and Anthropic (Claude).

What You Can Build

  • Data extraction pipelines: Scrape product prices, news headlines, or job listings and pipe them into databases or spreadsheets
  • Form automation: Fill out and submit forms across multiple sites as part of a larger workflow
  • Competitive monitoring: Track competitor pricing, features, or content changes on a schedule
  • Lead enrichment: Visit company websites to gather contact info, tech stacks, or social profiles
  • Automated testing: Validate web flows and capture results alongside your CI/CD pipeline

Get Started