1
0 Comments

Vibe Coding Without the $200 Price Tag: Multi-Model Workflow.

How much do you spend on monthly subscriptions?

If you’re already on a premium plan like Claude Max or ChatGPT Pro, stick with it, unless you really want to optimize your "vibe coding" workflow.

If you want to optimize your output while lowering your monthly spend, this is the stack that works best for me:

Designing Functional Specs

  • Model: Gemini 3.5 Flash
  • Process: Use "high thinking" mode (30% AI / 70% Human) via AI Studio, then export to Markdown. While some people have started using HTML for specs, Markdown still works best for me.

Architecture

  • Model: Claude 4.7 Opus (50% AI / 50% Human feedback)
  • Process: This stage is critical. Once the AI understands the spec, you must review the plan. It is vital to plan the folder structure and instruct your coding agent to document it. Define each folder and its purpose; store this in a reference or docs folder and link to it from CLAUDE.md or AGENT.md. Spending more tokens and time here will save you hours later.

Testing

  • Process: Yes, write your tests first. I usually use Opus 4.7 or GPT 5.5, though "max effort" usually isn't necessary. Don’t just say "make the tests." Ask the AI to draft a test plan, then review it. Brainstorm edge cases and iterate a few times. Once the cases are finalized, implement them using GLM (if you have), Flash, or Sonnet with "medium effort."

Implementation

Whether you divide tasks by user stories, build the full stack simultaneously, or focus on the UI first, here is my model breakdown. Always ask to run test after implementation:

  • Backend: Opus 4.7 or GPT 5.5 for complex logic / Sonnet 4.6 for simpler logic
  • Frontend: Gemini 3.5 Flash or GPT 5.5 (you can use stitch MCP if you like)

Refactoring

  • Model: GLM-5.1/Sonnet/Haiku (if it is simple one)

Code Review

  • Model: GLM-5.1 (or GPT-5.5 if you’re rich) (80% AI / 20% Human)
  • Details: As others have noted, GPT models are exceptionally good at code review.

Once finished, update the spec document and create a handoff document. This stage doesn't require a powerful model or high reasoning effort.


Note

The GLM subscription used to be quite affordable, but they have since increased the price. However, you can still try the GLM model via NVIDIA NIM.
People say Kimi K2.6 is really good for the price. I am probably going to try K2.6 or Composer 2.5 in Cursor to compare their pricing and token usage

This has been my personal experience. If you have a different workflow, I’d love to hear it.

on May 23, 2026
Trending on Indie Hackers
AI runs 70% of my distribution. The exact stack. User Avatar 187 comments I'm a solo founder. It took me 9 months and at least 3 stack rewrites to ship my SaaS. User Avatar 148 comments I used $30,983 of AI tokens last month in Claude code on $200/mo plan User Avatar 62 comments I turned someone’s tweet into an app idea and it has made ~$3000 so far in 4 months. User Avatar 30 comments my reddit post got 600K+ views. here's exactly what i did User Avatar 30 comments We could see our AI bill, but not explain it — so I built AiKey User Avatar 25 comments