I wired OpenClaw into Slack + Telegram for three “small” jobs:
By lunch it felt perfect. By night I’d spent $250.
Not because OpenClaw is inherently expensive. Because I accidentally created a token snowball.
Summaries were “last 50 messages,” then “include relevant prior decisions,” then “keep continuity.” Each run carried more baggage. I treated the agent like it had memory, but what it really had was an ever-growing prompt.
The kicker: the workflow still felt like it was doing the same thing. The cost curve didn’t.
One workflow attached screenshots “for debugging.” Another pasted full tool outputs into the next step “just in case.” Every attachment and transcript became more tokens, and then those tokens became part of the next call too.
It wasn’t one big request. It was many medium requests slowly turning into huge ones.
I had a big system prompt (format rules, safety rails, tool guidance). I expected “write it once, reuse it forever.”
What I saw instead was behavior consistent with caching not sticking: repeated large prefixes being paid over and over. Even if you’re not hitting an obvious bug, relying on caching as your budget plan is fragile.
My scheduled jobs created a pattern: idle for a while, then fire. If the next run has to re-establish a large prompt footprint, you keep paying the setup cost repeatedly.
That’s how “a few automations” became a token furnace.
What I needed wasn’t another usage chart. I needed control at the point where money actually leaks: model choice per request.
That’s what ClawPane is:
auto, fast, economy, quality (or your own).The thesis: keep OpenClaw workflows the same, but stop paying premium-model tax for routine steps and reduce blast radius when things get noisy.
OpenClaw bills don’t explode because one call is expensive.
They explode because:
If you want predictability, you need (1) hard limits and trimming, and (2) routing that chooses the cheapest model that still does the job.
Prob cheaper just to buy mac minis and use an oss algo