1
1 Comment

I burned $250 of tokens in a day with OpenClaw. That day turned into ClawPane.

I wired OpenClaw into Slack + Telegram for three “small” jobs:

  • Morning digest
  • Summarize long threads into action items
  • Explain new errors and open GitHub issues

By lunch it felt perfect. By night I’d spent $250.

Not because OpenClaw is inherently expensive. Because I accidentally created a token snowball.


What actually happened

1) Context quietly grew until every call was heavy

Summaries were “last 50 messages,” then “include relevant prior decisions,” then “keep continuity.” Each run carried more baggage. I treated the agent like it had memory, but what it really had was an ever-growing prompt.

The kicker: the workflow still felt like it was doing the same thing. The cost curve didn’t.

2) Screenshots and tool payloads inflated requests

One workflow attached screenshots “for debugging.” Another pasted full tool outputs into the next step “just in case.” Every attachment and transcript became more tokens, and then those tokens became part of the next call too.

It wasn’t one big request. It was many medium requests slowly turning into huge ones.

3) Prompt caching didn’t behave like I assumed

I had a big system prompt (format rules, safety rails, tool guidance). I expected “write it once, reuse it forever.”

What I saw instead was behavior consistent with caching not sticking: repeated large prefixes being paid over and over. Even if you’re not hitting an obvious bug, relying on caching as your budget plan is fragile.

4) Idle gaps made the next run expensive again

My scheduled jobs created a pattern: idle for a while, then fire. If the next run has to re-establish a large prompt footprint, you keep paying the setup cost repeatedly.

That’s how “a few automations” became a token furnace.


The moment I decided to build ClawPane.co

What I needed wasn’t another usage chart. I needed control at the point where money actually leaks: model choice per request.

That’s what ClawPane is:

  • It plugs into OpenClaw as a model provider and routes requests automatically.
  • You stop hardcoding model names in every agent. You use router presets like auto, fast, economy, quality (or your own).
  • You can tune tradeoffs per router (cost vs latency vs quality vs carbon).
  • You get per-response visibility into what ran (selected model, cost, latency).
  • You can define fallback behavior so one provider hiccup doesn’t cascade into retries and duplicated spend.

The thesis: keep OpenClaw workflows the same, but stop paying premium-model tax for routine steps and reduce blast radius when things get noisy.


The takeaway

OpenClaw bills don’t explode because one call is expensive.

They explode because:

  • context grows,
  • payloads inflate,
  • caching assumptions fail,
  • scheduled runs re-pay setup costs,
  • and the system keeps running while you’re not looking.

If you want predictability, you need (1) hard limits and trimming, and (2) routing that chooses the cheapest model that still does the job.

posted to Icon for group Ideas and Validation
Ideas and Validation
on February 24, 2026
  1. 1

    Prob cheaper just to buy mac minis and use an oss algo

Trending on Indie Hackers
Agencies charge $5,000 for a 60-second product demo video. I make mine for $0. Here's the exact workflow. User Avatar 132 comments I've been building for months and made $0. Here's the honest psychological reason — and it's not what I expected. User Avatar 77 comments I wasted 6 months building a failed startup. Built TrendyRevenue to validate ideas in 10 seconds. User Avatar 59 comments Your files aren’t messy. They’re just stuck in the wrong system. User Avatar 29 comments This system tells you what’s working in your startup — every week User Avatar 25 comments Why Direction Matters More Than Motivation in Exam Preparation User Avatar 14 comments