2
0 Comments

Just spent the last week obsessing over how AI crawlers interact with content sites. Holy complexity, Batman.

Here's the thing nobody talks about: LLMs are scraping everything. Your carefully crafted content? Being digested by GPT, Claude, Gemini, you name it. And you have zero control unless you explicitly set boundaries.

I run a small SaaS tool, and we noticed our blog posts showing up in weird AI summaries without attribution. That's when I realized most people don't even know about llms.txt files - basically robots.txt for AI crawlers.

The concept is simple: specify which AI systems can access your content, set rate limits, define allowed paths. But implementing it? Total nightmare if you're not technical. XML formatting, syntax errors, testing endpoints...

So I built a generator that just handles it. 3 fields: your domain, which crawlers to allow (or block), and any path restrictions. Outputs a clean llms.txt file ready to deploy.

Early metrics: 47 websites using it in the first week. Most common use case? Preventing AI training on premium content while allowing search indexing.

The wild part? Half the users don't even monetize their content - they just want visibility into who's reading their work. Fair enough.

If you're sitting on a content site and haven't set up AI crawl controls yet, you're essentially giving away your IP. Not fearmongering - just data.

Built this at serpspur.com/tool/llms-txt-generator-tool if you want to skip the XML headache. Or just google "llms.txt spec" and DIY. Either way, get it done.

on July 4, 2026
Trending on Indie Hackers
The hardest part isn't building anymore User Avatar 91 comments I sold $6,773 in 2 weeks, with almost no existing community. User Avatar 60 comments Before you build another feature, use this workflow User Avatar 42 comments Ferguson is LIVE on ProductHunt today... so I audited their homepage first! User Avatar 38 comments Built a local-first Amazon profit-by-SKU + QuickBooks/Xero journal tool. Looking for founding users. User Avatar 32 comments I spent months chasing clients who already had a webmaster. So I built something that only finds the ones who don't. User Avatar 29 comments