Agent Browser | Agent Skills

472K installs

•

vercel-labs/agent-browser

•

by Vercel

Score

9.1

/ 10

Installs

472K

Repo Stars

36.8K

Last Updated

1d ago

Fresh

Quality Ratio

99%

Description

Verified

Language

Rust

First Published

Jan 2026

Summary

The Agent Browser agent skill enables AI agents to programmatically navigate, interact with, extract data from, and snapshot web pages using structured commands, effectively providing headless browser automation. This agent skill is ideal for AI developers and engineers building autonomous agents that need to perform complex web interactions, data extraction, or task automation directly on websites. It is a niche skill with a smaller but dedicated user base. Agents can use commands like `agent-browser open <url>`, `click <sel>`, `fill <sel> <text>`, and `get text <sel>` to perform actions and gather information. It facilitates an optimal AI workflow by allowing agents to `snapshot -i --json` to observe the page state and then act using deterministic `@ref` selectors for reliable element interaction. For stateful interactions, it supports session persistence for logged-in states and can connect to an existing Chrome profile for complex authentication flows like OAuth.

Skill Definition

Fast browser automation CLI for AI agents. Chrome/Chromium via CDP with accessibility-tree snapshots and compact @eN element refs.

Install: npm i -g agent-browser && agent-browser install

Start here

This file is a discovery stub, not the usage guide. Before running any agent-browser command, load the actual workflow content from the CLI:

agent-browser skills get core             # start here — workflows, common patterns, troubleshooting
agent-browser skills get core --full      # include full command reference and templates

The CLI serves skill content that always matches the installed version, so instructions never go stale. The content in this stub cannot change between releases, which is why it just points at skills get core.

Specialized skills

Load a specialized skill when the task falls outside browser web pages:

agent-browser skills get electron          # Electron desktop apps (VS Code, Slack, Discord, Figma, ...)
agent-browser skills get slack             # Slack workspace automation
agent-browser skills get dogfood           # Exploratory testing / QA / bug hunts
agent-browser skills get vercel-sandbox    # agent-browser inside Vercel Sandbox microVMs
agent-browser skills get agentcore         # AWS Bedrock AgentCore cloud browsers

Run agent-browser skills list to see everything available on the installed version.

Why agent-browser

Fast native Rust CLI, not a Node.js wrapper
Works with any AI agent (Cursor, Claude Code, Codex, Continue, Windsurf, etc.)
Chrome/Chromium via CDP with no Playwright or Puppeteer dependency
Accessibility-tree snapshots with element refs for reliable interaction
Sessions, authentication vault, state persistence, video recording
Specialized skills for Electron apps, Slack, exploratory testing, cloud providers

Observability Dashboard

The dashboard runs independently of browser sessions on port 4848 and can also be opened through a proxied or forwarded URL such as https://dashboard.agent-browser.localhost. Agents should stay on the dashboard origin: session tabs, status, and stream traffic are proxied internally, so session ports do not need to be exposed.

How to Use

Use in O-mega

Claude Code

npx skills add vercel-labs/agent-browser agent-browser