2
2 Comments

We just shipped a CLI for browser automation with built-in stealth and captcha solving

We're the team behind BrowserAct. We just released browser-act CLI — a command-line tool that handles stealth browsing, captcha solving, and clean data extraction out of the box.

We built it because we kept hitting the same walls with Playwright and Puppeteer: sites blocking headless browsers, selectors breaking after layout changes, and raw HTML burning too many tokens when piped into LLMs.


Two browser modes

Stealth browser — anti-detection fingerprinting, proxy support, persistent login sessions. For sites that block automation:

browser-act browser create "my-scraper" --proxy socks5://host:port
browser-act browser open <browser_id> https://target-site.com

Real Chrome — connects to your running Chrome via CDP, reuses your existing logins. No setup needed:

browser-act browser real open https://dashboard.example.com

Captcha solving

Cloudflare Turnstile, reCAPTCHA — one command:

browser-act solve-captcha

This is what headless scripts hit on protected sites. browser-act handles it natively.


Clean data extraction

Pages come back as markdown instead of raw HTML. One product page we tested: 854,044 characters of HTML vs 22,815 of markdown — 37x smaller.

browser-act get markdown    # structured text
browser-act get text 5      # text of a specific element
browser-act get value 3     # value of an input field

If you're feeding pages to an LLM, this alone changes your cost math.


How it works in practice

The core loop: open → inspect → interact → verify.

browser-act browser open <id> https://example.com/login
browser-act state
# → [3] input "Email", [4] input "Password", [5] button "Sign In"
browser-act input 3 "[email protected]"
browser-act input 4 "password123"
browser-act click 5
browser-act wait stable
browser-act state    # re-inspect after page change

Everything runs from the terminal. Chain with &&, pipe into scripts, schedule with cron.


Install:

uv tool install browser-act-cli --python 3.12

Source and docs: github.com/browser-act/skills

We'd love feedback — what browser automation tasks eat up your time? What would you want a tool like this to handle?

posted to Icon for group Product Launch
Product Launch
on April 15, 2026
  1. 1

    "Reducing HTML to Markdown by 37x is a complete game-changer for anyone piped into LLMs—the token savings alone make BrowserAct an easy sell. Native stealth and captcha solving out of the box finally makes headless automation feel reliable again.
    Since you've built such a high-leverage tool, you should enter it into this competition--“Prize pool just opened at $0. Your odds are genuinely the best they'll ever be.
    $19 entry. Winner gets a real trip to Tokyo — flights and hotel booked by us.
    Round 01 closes at 100 entries. tokyolore.com

  2. 1

    This looks pretty solid — especially the markdown extraction part, that’s a real practical improvement for LLM workflows.

    One thing that stood out though — the product itself feels like infrastructure-level tooling, but the “browser-act” naming comes across more like a utility or repo rather than something you’d build a company around.

    For tools solving blocking/captcha/automation at this level, perception plays a bigger role once people start relying on it in production.

    Curious if you’re thinking of this as a long-term product/company, or more of a dev tool for now?

Trending on Indie Hackers
Day 4: Why I Built a $199 Workspace Nobody Asked For User Avatar 56 comments Hi IH — quick update. The MVP is live. User Avatar 31 comments Building ExpenseSpy solo, no funding — launching June 17 on iOS & Android User Avatar 25 comments Day 7: 51 people answered my question. I wasn't ready for what they said. User Avatar 17 comments I Built a Football Sentiment Platform in 18 Days. The World Cup Starts in 7 Days. Now I Need Distribution. User Avatar 17 comments Built an n8n booking alert system — is cold outreach dead for B2B micro-tools? User Avatar 16 comments