6
15 Comments

I built a text-to-video AI in 30 days.

Here’s the stack, the bills, and why my initial launch flopped.
Long-time lurker, first-time poster. 👋

I’ve spent the last month building Textideo, a tool that turns text scripts into ready-to-post videos.

In the spirit of Indie Hackers, I’m not here to pitch you. Instead, I want to share the unpolished reality of building an AI product in 2024—specifically the tech choices, the surprisingly high server costs, and how I completely messed up my first launch.

😫 The Itch: The "Content Treadmill"

I’m a developer, not a video editor. But to market my previous side projects, I needed video content.

I found myself spending 4 hours editing a 30-second clip just to post it on Twitter. The friction was insane. I had folders full of blog posts and notes that were dying on my hard drive because I didn't have the energy to visualize them.

I wanted a tool where I could dump text and get a decent base video out. Since I couldn't find one that fit my budget/workflow, I decided to build it.

🛠 The Stack (Keeping it lean)

I time-boxed the MVP build to 30 days to avoid feature creep.

  • Frontend: Next.js + Tailwind (Hosted on Vercel)
  • Backend: Python/FastAPI (Needed for better library support with media processing)
  • The "Brain": OpenAI GPT-4o API (For script analysis and scene splitting)
  • The Visuals: [Stable Diffusion / Pexels API / Stock Footage API]
  • Video Assembly: FFmpeg (The true hero of this project, running on a separate worker)
  • Database: Supabase
  • Payments: Stripe

The Technical Nightmare:
Handling video rendering in the cloud is painful. My first few renders took 10+ minutes for a 30-second clip because I was processing everything synchronously. I had to rewrite the backend to use a job queue (Redis + Celery/BullMQ), which cut the user's perceived wait time significantly.

💸 The Bills (The "AI Tax")

This is something people don't talk about enough. AI wrappers are expensive to run.

  • OpenAI API costs: ~$0.03 per video script.
  • Image Generation/Stock API: ~$0.10 - $0.20 per video.
  • GPU/Worker Server: ~$40/month (fixed cost).

My margins are thinner than a traditional SaaS. This forced me to rethink my pricing model immediately—I couldn't offer a generous "Unlimited Free Tier" without going bankrupt.

📉 The Launch & The Flop

I launched on [Twitter/Product Hunt] thinking, "This solves a real problem, people will flock to it."

Result:

  • Visitors: 400
  • Signups: 15
  • Paying users: 0

Why? I positioned it as a generic "Text to Video" tool.
It turns out, nobody wakes up looking for a "general video tool." They look for "How to make TikToks without showing my face" or "How to turn blog posts into YouTube Shorts."

🔄 The Pivot

I stopped coding and started DMing. I reached out to content writers and newsletter authors specifically. I offered to manually convert one of their articles into a video using my tool.

This "Do things that don't scale" approach worked.

  • I got direct feedback on the video pacing.
  • I found my first paying customer after the 5th manual demo.

🧠 Lessons for other Indie Hackers

  1. UX > AI Models: Users don't care if you use Llama 3 or GPT-4. They care if the video export fails. Stability is your best feature.
  2. Niche Down Hard: The "AI Video" space is crowded by giants (Runway, Sora, HeyGen). As an indie, you can't compete on quality; you have to compete on workflow specific to a niche.
  3. Watch your margins: If you are building GenAI tools, calculate your unit economics on Day 1.

🤔 I need your help

I’m currently struggling with the pricing model. I'm debating between:

  1. Credit System: Pay-as-you-go (Good for margins, bad for MRR).
  2. Monthly Sub: Predictable revenue, but power users might kill my API budget.

If you’ve built an API-heavy wrapper, how do you handle this?

I’ll be hanging out in the comments all day. Feel free to roast the landing page (link in the first comment) or ask about the FFmpeg implementation!

on November 21, 2025
  1. 1

    good luck :) I just built one , image to video for AI UGC videos

  2. 1

    This is really cool and you have a fascinating tech. Definitely want to learn more not just in your tech, but in how your solution is differentiated from direct AI video generating promts

  3. 1

    The main takeaway is that AI tech is amazing, but cloud costs and positioning are critical. Video rendering, API usage, and GPUs add up fast, so a generic "text-to-video" approach often flops without a clear target audience.

  4. 2

    Here is the link if you want to roast the LP: textideo.com

  5. 1

    This is really cool, I worked on a similar project https://vidbuilder.ai but not using Video models its for Text and Animation based videos. I would like to hear your comments and maybe we can join forces together

  6. 1

    That’s incredible! Building a text-to-video AI in just 30 days shows remarkable skill, focus, and creativity. The potential applications—from content creation to education—are huge. Excited to see how it performs and evolves. Truly inspiring work for anyone interested in AI and multimedia innovation.

  7. 1

    Love the "stopped coding and started DMing" lesson. That's such a hard shift to make as a developer—we want to believe the product will speak for itself.
    On the pricing question: I'd suggest starting with a credit system instead of subscriptions. Here's why—users are hesitant to commit to monthly fees for a new product they haven't tested thoroughly. Credits let them try it with low commitment, and you can always migrate to subscriptions once you have a solid user base and proven retention.
    The transition path could be: credits → credits + subscription bundles → pure subscription tiers. Lets you validate willingness to pay without scaring away early adopters.

  8. 1

    Damn, this is one of the most honest breakdowns I’ve seen about building an AI video tool — especially the part about the “AI tax” nuking your margins. The pivot to doing manual demos was a smart move; most people never talk to their actual users. Curious to see where you take it next, especially how you tackle pricing without letting power-users melt your GPU bill.

  9. 1

    Wow, building a text-to-video AI in just 30 days is incredible! This showcases impressive technical skill, creativity, and dedication. The potential applications—from content creation to education—are huge. Excited to see how it performs and evolves. Truly inspiring work for anyone interested in AI and multimedia innovation.

  10. 1

    This is super impressive 30 days is a very tight build cycle for anything video-related, especially when dealing with rendering + model latencies.

    I’m curious about two things from your journey:
    1. Which part took the most time to get right?
    The model workflow, video pacing, or getting consistent visual quality?
    2. How did you validate that creators actually wanted this format?
    I’m building something for creator workflows myself, and picking the right use case has been harder than the technical side.

    Really admire the speed and the dedication. Would love to hear more about what surprised you the most during the build.

  11. 1

    Loved the level of detail here.
    The job queue rewrite and async rendering were definitely the right move — cloud video assembly is brutal without proper workers. Also agree that stability beats model choice 100% of the time.
    On the pricing side, most AI tools I’ve built ended up using a “base subscription + metered usage” model. It filters out abusers while keeping normal creators happy.
    Thanks for sharing the honest numbers and lessons — super refreshing to read.

    1. 1

      Glad you enjoyed the read! The queue rewrite was painful but absolutely necessary. And thanks for the pricing tip—the "base + metered" model definitely seems like the smartest way to scale sustainably. Appreciate the insight!

  12. 1

    Speed of execution is everything here. Great job on shipping this fast! 🚀
    With giants like Sora and Runway dominating the general space, do you plan to niche down to a specific vertical (e.g., marketing, faceless channels, or education)? I feel like focusing on a specific use-case is the indie superpower right now.

    1. 1

      Appreciate the support! ⚡️ And I totally agree with your take on the "indie superpower."
      We are currently exploring a few verticals, including e.g., faceless channels and ads. Since we can move faster than the giants, we're letting our early users guide us to the most painful problem to solve. Do you have a personal favorite use-case you'd like to see?

Trending on Indie Hackers
You don't need to write the same thing again User Avatar 27 comments I built something that helps founders turn user clicks into real change 🌱✨ User Avatar 26 comments I built an Image-to-3D SaaS using Tencent's Hunyuan 3D AI User Avatar 25 comments The Era of Malleable SaaS: Is This the End of "Opinionated Software"? User Avatar 17 comments Let’s Talk: What’s Missing in Today’s App Builders? User Avatar 17 comments