Auto Browser

FreeNot checked

Human-in-the-loop automation framework. Log them in, watch them run.

by LvcidPsyche

GitHub Embed

About

Human-in-the-loop automation framework. Log them in, watch them run.

README

CI PyPI License: MIT MCP Server Local First Glama Open in GitHub Codespaces

Give your AI agent a real browser, with a human in the loop.

Auto Browser is an MCP-native browser control plane for authorized workflows. It gives MCP clients, LLM agents, and operators a shared Playwright browser with human takeover, reusable auth profiles, approvals, audit trails, and local-first deployment.

Works with:

Claude Desktop
Cursor
any MCP client that can talk HTTP or stdio
direct REST callers when you want curl-first control

Why Auto Browser

MCP-native from day one. The browser surface is already packaged as an MCP server instead of bolted on after the fact.
Human takeover when the web gets brittle. noVNC keeps the same live session available when a person needs to step in.
Login once, reuse later. Save named auth profiles and reopen fresh sessions that are already signed in.
Local-first by default. Run the full stack on your own box with Docker Compose, or use Codespaces for a quick hosted demo.
Safety rails built in. Approvals, operator identity, PII scrubbing, Witness receipts, and policy presets are all part of the product surface.
Governed skill induction. Verified browser traces can become staged skill candidates with signed provenance, verifier adapters, and review-only graduation — agents that prove they can repeat themselves correctly, not just act once.

Release Highlights (v1.2.1)

On PyPI. pip install auto-browser-client for the SDK, pip install auto-browser-langchain for the LangChain/LangGraph/CrewAI adapters, and uvx auto-browser-mcp to run the MCP stdio bridge with zero setup. Releases publish via PyPI trusted publishing (OIDC) on tag push.

Since v1.2.0

Verifiable Witness receipts. Receipts were always hash-chained at write time; now you can check the chain on demand. GET /sessions/{id}/witness/verify and the read-only browser.verify_witness MCP tool walk the full chain and report the first divergent receipt if the log was altered, reordered, or truncated.
Sturdier session isolation. Per-session browser containers now get memory/PID/CPU caps, and the controller reaps containers orphaned by a crash at startup.
Fresh dependency floor. Playwright 1.60 (controller and browser-node aligned), uvicorn 0.49, and a unified, current user-agent pool replacing the stale Chrome 122-era defaults.
Quieter failure modes. Cleanup and capture paths that previously swallowed errors now log them with context.
Release gates in CI continue to enforce dependency audits, fixture evals, client tests, Python wheel builds, and the 80% controller coverage gate — now on Python 3.11 and 3.14.

See CHANGELOG.md for the full release history.

Good Fits

internal dashboards and admin tools
operator-assisted QA and browser debugging
login-once, reuse-later account workflows
brittle sites where a human may need to recover the flow
MCP-powered agent workflows that need a real browser, not just HTML fetches

Not the Goal

CAPTCHA solving
unauthorized scraping or account automation
deceptive identity shaping or bypass tooling

What You Get

Browser Control	Operator Safety	Deployment and Integration
Playwright-backed sessions with screenshots, DOM summaries, OCR excerpts, tab controls, downloads, and network inspection	approval gates, operator identity headers, audit events, PII scrubbing, Witness receipts, and protection profiles	MCP over HTTP, bundled stdio bridge, REST API, Docker Compose, Codespaces, auth profiles, and optional per-session isolation

Quickstart

git clone https://github.com/LvcidPsyche/auto-browser.git
cd auto-browser
docker compose up --build

That is enough for local development with the default settings.

Optional:

cp .env.example .env
make doctor

Run make doctor from a normal terminal with local Docker access and permission to open localhost sockets.

Open:

API docs: http://127.0.0.1:8000/docs
Operator dashboard: http://127.0.0.1:8000/dashboard
Visual takeover: http://127.0.0.1:6080/vnc.html?autoconnect=true&resize=scale

All published ports bind to 127.0.0.1 by default.

Try It in Codespaces

Open in GitHub Codespaces

Codespaces provisions the stack automatically. The dashboard and noVNC tabs are usually ready in about 90 seconds.

First Useful Demo

The highest-signal flow in this repo is:

create a session
log in manually if the site needs a human
save the session as a named auth profile
open a new session from that auth profile
continue work without reauthing

Start here:

Minimal session creation:

curl -s http://127.0.0.1:8000/sessions \
  -X POST \
  -H 'content-type: application/json' \
  -d '{"name":"demo","start_url":"https://example.com"}' | jq

Minimal observation:

curl -s http://127.0.0.1:8000/sessions/<session-id>/observe | jq

MCP Clients

Auto Browser exposes:

an HTTP MCP endpoint at http://127.0.0.1:8000/mcp
convenience endpoints at http://127.0.0.1:8000/mcp/tools and http://127.0.0.1:8000/mcp/tools/call
a stdio bridge: uvx auto-browser-mcp from PyPI, or scripts/mcp_stdio_bridge.py in a repo checkout

The default MCP tool profile is curated, which keeps the browser surface compact for better tool selection. If you want the full internal tool surface, set:

MCP_TOOL_PROFILE=full

Raw tool-call example:

curl -s http://127.0.0.1:8000/mcp/tools/call \
  -X POST \
  -H 'content-type: application/json' \
  -d '{
    "name":"browser.create_session",
    "arguments":{
      "name":"demo",
      "start_url":"https://example.com"
    }
  }' | jq

Client setup guides:

Convergence Harness

Auto Browser ships a Stage 0 convergence harness for Agent Skill Induction. It runs a structured task contract, records tamper-checked traces, verifies completion, and writes a staged skill candidate with signed provenance. Generated skills are staged only — promotion stays explicit and reviewed.

Read-only inspection tools (harness.list_runs, harness.get_status, harness.get_trace) are exposed in the default curated MCP tool profile so agents can introspect harness state without elevated access. Convergence runs, drift checks, candidate management, and graduation require MCP_TOOL_PROFILE=full, or can be invoked directly over REST.

Start with docs/convergence-harness.md. A deterministic local smoke is:

python -m controller.harness.run --contract evals/contracts/example_read.json --mock-final-url https://example.com --mock-final-text "Example Domain"

For MCP clients, set MCP_TOOL_PROFILE=full to expose the harness.* tools.

Security and Compliance

For a real private deployment, set at least:

APP_ENV=production
API_BEARER_TOKEN=<strong-random-secret>
REQUIRE_OPERATOR_ID=true
AUTH_STATE_ENCRYPTION_KEY=<44-char-fernet-key>
REQUIRE_AUTH_STATE_ENCRYPTION=true
REQUEST_RATE_LIMIT_ENABLED=true
METRICS_ENABLED=true
STEALTH_ENABLED=false

COMPLIANCE_TEMPLATE can apply a preconfigured posture at startup:

Preset	Auth Encryption	Operator ID	PII Scrub	Isolation	Max Session Age
`strict`	required	required	all layers	`docker_ephemeral`	4h
`balanced`	-	required	network + text	shared	24h

Both presets require upload approvals and enable Witness receipts. Startup writes the applied policy to /data/compliance-manifest.json. The legacy names (HIPAA, SOC2, GDPR, PCI-DSS) still work as deprecated aliases and emit a warning at startup.

Example:

COMPLIANCE_TEMPLATE=strict docker compose up

For deployment details, hosted Witness notes, CLI auth modes, and reverse-SSH guidance, see:

Architecture at a Glance

flowchart LR
    User[Human operator] -->|watch / takeover| noVNC[noVNC]
    LLM[OpenAI / Claude / Gemini] -->|shared tools| Controller[Controller API]
    Controller -->|Playwright protocol| Browser[Browser node]
    noVNC --> Browser
    Browser --> Artifacts[(screenshots / traces / auth state)]
    Controller --> Artifacts
    Controller --> Policy[Allowlist + approval gates]

Core components:

browser-node/ runs Chromium, Xvfb, x11vnc, and noVNC
controller/ exposes the FastAPI controller, MCP transport, policy rails, and orchestration endpoints
data/ holds runtime artifacts, auth state, approvals, audit logs, and optional CLI caches
scripts/ contains local helpers for doctor, smoke tests, bridges, and release checks

Repo Guide

Path	What It Contains
controller/	controller API, MCP transport, tests, and packaging
browser-node/	browser runtime and Playwright connection layer
examples/	copy-paste flows and MCP client setup
integrations/langchain/	LangChain, LangGraph, and CrewAI adapters
docs/	architecture, deployment, hardening, and launch docs
scripts/	doctor, smoke harnesses, stdio bridge, and auth helpers
ops/	supporting service templates and operational assets

Common Commands

Command	Purpose
`make help`	list available repo commands
`make lint`	run Ruff checks on app, tests, and helper scripts
`make test`	run controller tests in Docker
`make test-local`	run controller tests on host Python 3.10+
`make eval`	run deterministic provider/profile eval scoring
`make doctor`	run the local readiness smoke
`make release-audit`	run the fuller release-validation pass
`make smoke-isolation`	verify per-session Docker isolation
`make smoke-reverse-ssh`	verify reverse-SSH remote access

Documentation Map

If You Want To...	Start Here
understand the system shape	docs/architecture.md
connect Claude Desktop or Cursor	docs/mcp-clients.md
run the curl-first examples	examples/README.md
deploy on a trusted host	docs/deployment.md
review production constraints	docs/production-hardening.md
run the convergence harness	docs/convergence-harness.md
inspect release history	CHANGELOG.md
see where the project is headed	ROADMAP.md

Contributing

If you want to help, start with:

If Auto Browser is useful, a star helps other people find it. Sponsorship and tip options live in TIPS.md.

from github.com/LvcidPsyche/auto-browser

Install Auto Browser in Claude Desktop, Claude Code & Cursor

Run in your terminal:

claude mcp add auto-browser -- npx

FAQ

Is Auto Browser MCP free?

Yes, Auto Browser MCP is free — one-click install via Unyly at no cost.

Does Auto Browser need an API key?

No, Auto Browser runs without API keys or environment variables.

Is Auto Browser hosted or self-hosted?

Self-hosted: the server runs locally on your machine via the install command above.

How do I install Auto Browser in Claude Desktop, Claude Code or Cursor?

Open Auto Browser on unyly.org, pick your client tab (Claude Desktop, Claude Code, Cursor) and press Install — the config is generated automatically, no JSON editing.

Related MCPs

Playwright

Browser automation, scraping, screenshots

by Microsoft

4.915.2K

Puppeteer

Browser automation and web scraping.

by modelcontextprotocol

opentabs-dev/opentabs

Plugin-based MCP server + Chrome extension that gives AI agents access to web applications through the user's authenticated browser session. 100+ plugins with a

by opentabs-dev

robhunter/agentdeals

1,500+ developer infrastructure deals, free tiers, and startup programs across 54 categories. Search deals, compare vendors, plan stacks, and track pricing chan

by robhunter

Compare Auto Browser with

Auto BrowservsPlaywright Auto BrowservsPuppeteer Auto Browservsopentabs-dev/opentabs Auto Browservsrobhunter/agentdeals

Not sure what to pick?

Find your stack in 60 seconds

Author?

Embed badge for your README

Browse similar

All browse MCPs

loading…

Browse all

Auto Browser

FreeNot checked

Human-in-the-loop automation framework. Log them in, watch them run.

by LvcidPsyche

GitHub Embed

About

Human-in-the-loop automation framework. Log them in, watch them run.

README

CI PyPI License: MIT MCP Server Local First Glama Open in GitHub Codespaces

Give your AI agent a real browser, with a human in the loop.

Works with:

Claude Desktop
Cursor
any MCP client that can talk HTTP or stdio
direct REST callers when you want curl-first control

Why Auto Browser

MCP-native from day one. The browser surface is already packaged as an MCP server instead of bolted on after the fact.
Human takeover when the web gets brittle. noVNC keeps the same live session available when a person needs to step in.
Login once, reuse later. Save named auth profiles and reopen fresh sessions that are already signed in.
Local-first by default. Run the full stack on your own box with Docker Compose, or use Codespaces for a quick hosted demo.
Safety rails built in. Approvals, operator identity, PII scrubbing, Witness receipts, and policy presets are all part of the product surface.
Governed skill induction. Verified browser traces can become staged skill candidates with signed provenance, verifier adapters, and review-only graduation — agents that prove they can repeat themselves correctly, not just act once.

Release Highlights (v1.2.1)

On PyPI. pip install auto-browser-client for the SDK, pip install auto-browser-langchain for the LangChain/LangGraph/CrewAI adapters, and uvx auto-browser-mcp to run the MCP stdio bridge with zero setup. Releases publish via PyPI trusted publishing (OIDC) on tag push.

Since v1.2.0

Verifiable Witness receipts. Receipts were always hash-chained at write time; now you can check the chain on demand. GET /sessions/{id}/witness/verify and the read-only browser.verify_witness MCP tool walk the full chain and report the first divergent receipt if the log was altered, reordered, or truncated.
Sturdier session isolation. Per-session browser containers now get memory/PID/CPU caps, and the controller reaps containers orphaned by a crash at startup.
Fresh dependency floor. Playwright 1.60 (controller and browser-node aligned), uvicorn 0.49, and a unified, current user-agent pool replacing the stale Chrome 122-era defaults.
Quieter failure modes. Cleanup and capture paths that previously swallowed errors now log them with context.
Release gates in CI continue to enforce dependency audits, fixture evals, client tests, Python wheel builds, and the 80% controller coverage gate — now on Python 3.11 and 3.14.

See CHANGELOG.md for the full release history.

Good Fits

internal dashboards and admin tools
operator-assisted QA and browser debugging
login-once, reuse-later account workflows
brittle sites where a human may need to recover the flow
MCP-powered agent workflows that need a real browser, not just HTML fetches

Not the Goal

CAPTCHA solving
unauthorized scraping or account automation
deceptive identity shaping or bypass tooling

What You Get

Browser Control	Operator Safety	Deployment and Integration
Playwright-backed sessions with screenshots, DOM summaries, OCR excerpts, tab controls, downloads, and network inspection	approval gates, operator identity headers, audit events, PII scrubbing, Witness receipts, and protection profiles	MCP over HTTP, bundled stdio bridge, REST API, Docker Compose, Codespaces, auth profiles, and optional per-session isolation

Quickstart

git clone https://github.com/LvcidPsyche/auto-browser.git
cd auto-browser
docker compose up --build

That is enough for local development with the default settings.

Optional:

cp .env.example .env
make doctor

Run make doctor from a normal terminal with local Docker access and permission to open localhost sockets.

Open:

API docs: http://127.0.0.1:8000/docs
Operator dashboard: http://127.0.0.1:8000/dashboard
Visual takeover: http://127.0.0.1:6080/vnc.html?autoconnect=true&resize=scale

All published ports bind to 127.0.0.1 by default.

Try It in Codespaces

Open in GitHub Codespaces

Codespaces provisions the stack automatically. The dashboard and noVNC tabs are usually ready in about 90 seconds.

First Useful Demo

The highest-signal flow in this repo is:

create a session
log in manually if the site needs a human
save the session as a named auth profile
open a new session from that auth profile
continue work without reauthing

Start here:

Minimal session creation:

curl -s http://127.0.0.1:8000/sessions \
  -X POST \
  -H 'content-type: application/json' \
  -d '{"name":"demo","start_url":"https://example.com"}' | jq

Minimal observation:

curl -s http://127.0.0.1:8000/sessions/<session-id>/observe | jq

MCP Clients

Auto Browser exposes:

an HTTP MCP endpoint at http://127.0.0.1:8000/mcp
convenience endpoints at http://127.0.0.1:8000/mcp/tools and http://127.0.0.1:8000/mcp/tools/call
a stdio bridge: uvx auto-browser-mcp from PyPI, or scripts/mcp_stdio_bridge.py in a repo checkout

The default MCP tool profile is curated, which keeps the browser surface compact for better tool selection. If you want the full internal tool surface, set:

MCP_TOOL_PROFILE=full

Raw tool-call example:

curl -s http://127.0.0.1:8000/mcp/tools/call \
  -X POST \
  -H 'content-type: application/json' \
  -d '{
    "name":"browser.create_session",
    "arguments":{
      "name":"demo",
      "start_url":"https://example.com"
    }
  }' | jq

Client setup guides:

Convergence Harness

Start with docs/convergence-harness.md. A deterministic local smoke is:

python -m controller.harness.run --contract evals/contracts/example_read.json --mock-final-url https://example.com --mock-final-text "Example Domain"

For MCP clients, set MCP_TOOL_PROFILE=full to expose the harness.* tools.

Security and Compliance

For a real private deployment, set at least:

APP_ENV=production
API_BEARER_TOKEN=<strong-random-secret>
REQUIRE_OPERATOR_ID=true
AUTH_STATE_ENCRYPTION_KEY=<44-char-fernet-key>
REQUIRE_AUTH_STATE_ENCRYPTION=true
REQUEST_RATE_LIMIT_ENABLED=true
METRICS_ENABLED=true
STEALTH_ENABLED=false

COMPLIANCE_TEMPLATE can apply a preconfigured posture at startup:

Preset	Auth Encryption	Operator ID	PII Scrub	Isolation	Max Session Age
`strict`	required	required	all layers	`docker_ephemeral`	4h
`balanced`	-	required	network + text	shared	24h

Example:

COMPLIANCE_TEMPLATE=strict docker compose up

For deployment details, hosted Witness notes, CLI auth modes, and reverse-SSH guidance, see:

Architecture at a Glance

flowchart LR
    User[Human operator] -->|watch / takeover| noVNC[noVNC]
    LLM[OpenAI / Claude / Gemini] -->|shared tools| Controller[Controller API]
    Controller -->|Playwright protocol| Browser[Browser node]
    noVNC --> Browser
    Browser --> Artifacts[(screenshots / traces / auth state)]
    Controller --> Artifacts
    Controller --> Policy[Allowlist + approval gates]

Core components:

browser-node/ runs Chromium, Xvfb, x11vnc, and noVNC
controller/ exposes the FastAPI controller, MCP transport, policy rails, and orchestration endpoints
data/ holds runtime artifacts, auth state, approvals, audit logs, and optional CLI caches
scripts/ contains local helpers for doctor, smoke tests, bridges, and release checks

Repo Guide

Path	What It Contains
controller/	controller API, MCP transport, tests, and packaging
browser-node/	browser runtime and Playwright connection layer
examples/	copy-paste flows and MCP client setup
integrations/langchain/	LangChain, LangGraph, and CrewAI adapters
docs/	architecture, deployment, hardening, and launch docs
scripts/	doctor, smoke harnesses, stdio bridge, and auth helpers
ops/	supporting service templates and operational assets

Common Commands

Command	Purpose
`make help`	list available repo commands
`make lint`	run Ruff checks on app, tests, and helper scripts
`make test`	run controller tests in Docker
`make test-local`	run controller tests on host Python 3.10+
`make eval`	run deterministic provider/profile eval scoring
`make doctor`	run the local readiness smoke
`make release-audit`	run the fuller release-validation pass
`make smoke-isolation`	verify per-session Docker isolation
`make smoke-reverse-ssh`	verify reverse-SSH remote access

Documentation Map

If You Want To...	Start Here
understand the system shape	docs/architecture.md
connect Claude Desktop or Cursor	docs/mcp-clients.md
run the curl-first examples	examples/README.md
deploy on a trusted host	docs/deployment.md
review production constraints	docs/production-hardening.md
run the convergence harness	docs/convergence-harness.md
inspect release history	CHANGELOG.md
see where the project is headed	ROADMAP.md

Contributing

If you want to help, start with:

If Auto Browser is useful, a star helps other people find it. Sponsorship and tip options live in TIPS.md.

from github.com/LvcidPsyche/auto-browser

Install Auto Browser in Claude Desktop, Claude Code & Cursor

Run in your terminal:

claude mcp add auto-browser -- npx

FAQ

Is Auto Browser MCP free?

Yes, Auto Browser MCP is free — one-click install via Unyly at no cost.

Does Auto Browser need an API key?

No, Auto Browser runs without API keys or environment variables.

Is Auto Browser hosted or self-hosted?

Self-hosted: the server runs locally on your machine via the install command above.

How do I install Auto Browser in Claude Desktop, Claude Code or Cursor?

Open Auto Browser on unyly.org, pick your client tab (Claude Desktop, Claude Code, Cursor) and press Install — the config is generated automatically, no JSON editing.

Related MCPs

Playwright

Browser automation, scraping, screenshots

by Microsoft

4.915.2K

Puppeteer

Browser automation and web scraping.

by modelcontextprotocol

opentabs-dev/opentabs

Plugin-based MCP server + Chrome extension that gives AI agents access to web applications through the user's authenticated browser session. 100+ plugins with a

by opentabs-dev

robhunter/agentdeals

1,500+ developer infrastructure deals, free tiers, and startup programs across 54 categories. Search deals, compare vendors, plan stacks, and track pricing chan

by robhunter

Compare Auto Browser with

Auto BrowservsPlaywright Auto BrowservsPuppeteer Auto Browservsopentabs-dev/opentabs Auto Browservsrobhunter/agentdeals

Not sure what to pick?

Find your stack in 60 seconds

Author?

Embed badge for your README

Browse similar

All browse MCPs

Command Palette

Auto Browser

About

README

Why Auto Browser

Release Highlights (v1.2.1)

Since v1.2.0

Good Fits

Not the Goal

What You Get

Quickstart

Try It in Codespaces

First Useful Demo

MCP Clients

Convergence Harness

Security and Compliance

Architecture at a Glance

Repo Guide

Common Commands

Documentation Map

Contributing

Install Auto Browser in Claude Desktop, Claude Code & Cursor

FAQ

Is Auto Browser MCP free?

Does Auto Browser need an API key?

Is Auto Browser hosted or self-hosted?

How do I install Auto Browser in Claude Desktop, Claude Code or Cursor?

Related MCPs

Playwright

Puppeteer

opentabs-dev/opentabs

robhunter/agentdeals

Compare Auto Browser with

Auto Browser

About

README

Why Auto Browser

Release Highlights (v1.2.1)

Since v1.2.0

Good Fits

Not the Goal

What You Get

Quickstart

Try It in Codespaces

First Useful Demo

MCP Clients

Convergence Harness

Security and Compliance

Architecture at a Glance

Repo Guide

Common Commands

Documentation Map

Contributing

Install Auto Browser in Claude Desktop, Claude Code & Cursor

FAQ

Is Auto Browser MCP free?

Does Auto Browser need an API key?

Is Auto Browser hosted or self-hosted?

How do I install Auto Browser in Claude Desktop, Claude Code or Cursor?

Related MCPs

Playwright

Puppeteer

opentabs-dev/opentabs

robhunter/agentdeals

Compare Auto Browser with