Skip to main content
Agent Browser is a headless browser automation CLI built for AI agents. It provides a fast Rust-powered command-line interface with accessibility-first semantics, making it ideal for LLM-driven browser control. Agent Browser has built-in Browserbase support via its -p browserbase provider flag, giving your agents cloud browser sessions with no local browser required.

Key Features

  • Accessibility tree snapshots with element refs (@e1, @e2) optimized for LLM consumption
  • Full browser control via simple CLI commands — click, type, fill, screenshot, navigate
  • Built-in Browserbase provider — connect to cloud browsers with a single flag
  • Rust-powered performance with sub-millisecond parsing overhead
  • Semantic locators — find elements by ARIA role, text, label, placeholder, or test ID

How It Works

AI Agent → agent-browser CLI → Browserbase Cloud Browser → Target Website → Data Back
Your AI agent calls agent-browser commands to control a Browserbase cloud browser. The snapshot command returns an accessibility tree with element references that the agent can use for deterministic interactions.

When to Use This

  • AI agent tooling — give LLMs a CLI interface to control browsers
  • Automated web scraping — extract data using accessibility tree snapshots
  • Form filling and testing — interact with web forms via simple commands
  • Screenshot and PDF generation — capture pages from cloud browsers

Get Started