If your product uses AI, every click increases your costs.
Here's how to turn LLM pricing into plans, credits, and limits that won’t come back to bite you.
If you start from “tokens”, your brain melts. Start from what your user actually does.
Write down the main things in your product that use AI. For example:
Each of these is a single AI action. You’ll want to break every workflow or feature into single actions.
In your product, you will usually have two kinds of models:
Give them simple labels in your system:
These labels are only for you and your team. Users don’t choose them.
From here on, we’ll talk about Standard vs Deep, not specific model names. That way this works no matter which AI provider you use.
Use the same action list from step 1.
Now you want to know: “How much does one of these actions cost me, roughly?”
You can’t get real prices, but you can measure size.
For each action:
Use this:
Examples:
Now you know which actions are light and which ones are heavy.
If your product is live
Now you can get real numbers.
Cost per action ≈ total cost for that action ÷ number of runs
Do this for every action on your list.
At the end of this step you have:
You’ll use this in the next step to get cost per user per month.
You now know the cost of one run of each action.
Now ask: “For a normal paying user, how many times does each action run in a month?”
If you have users: use analytics to see how often each action is used per user.
If you don’t: make a simple and honest estimate for each action.
For each action: Cost per user for this action = (uses per month) × (cost per action)
Add all actions together: AI cost per active user per month.
You will use this number to:
AI is not your only cost.
You also pay for things like:
If you have users:
That gives other cost per user.
If you are pre-launch: Make a simple estimate, and update it later when you have real data.
Then: Total cost per user = AI cost per user + other cost per user
Later, you’ll set your prices so they are well above this total cost.
Remember: As you grow, this number can change (for example, servers can get cheaper per user).
So check it again from time to time.
You can charge in many ways:
Here we’ll assume you use credits, either alone or on top of a subscription.
Use your cost per action from step 3.
Pick simple numbers like 1, 2, 5, 10 and assign them to actions:
More expensive actions should use more credits.
From steps 4 and 5, you know:
Decide how much AI cost you’re happy to include in this plan (for example, “about $5 of AI per user”).
Then:
In the UI, you can show something simple like: “You’ve used 700 / 1,500 AI credits this month.”
You can also save a lot by changing how you call the model.
Here are a few simple tricks that don’t hurt UX:
Use the cheap model most of the time. Use it for normal chat, small edits, and short summaries. Most people won’t see the difference. They only care that it’s fast and clear.
Keep context small. Don’t send the whole chat every time. Make a short summary of old messages and send that instead. For documents, ask the user what they care about, and only send those parts.
Keep answers short. In your prompt, say the AI something like: “Keep the answer under 200 words.” “Give 3 bullet points only.”
Cache old answers. If people ask the same question many times, save the answer. Next time the question is the same, show the saved answer instead of calling the AI again.
These simple steps can cut your AI bill a lot.