-p browserbase provider flag, giving your agents cloud browser sessions with no local browser required.
Key Features
- Accessibility tree snapshots with element refs (
@e1,@e2) optimized for LLM consumption - Full browser control via simple CLI commands — click, type, fill, screenshot, navigate
- Built-in Browserbase provider — connect to cloud browsers with a single flag
- Rust-powered performance with sub-millisecond parsing overhead
- Semantic locators — find elements by ARIA role, text, label, placeholder, or test ID
How It Works
agent-browser commands to control a Browserbase cloud browser. The snapshot command returns an accessibility tree with element references that the agent can use for deterministic interactions.
When to Use This
- AI agent tooling — give LLMs a CLI interface to control browsers
- Automated web scraping — extract data using accessibility tree snapshots
- Form filling and testing — interact with web forms via simple commands
- Screenshot and PDF generation — capture pages from cloud browsers