I Built a Godot AI Plugin That Actually Edits Scenes — Here's the Stack

Ziva editing scene trees, generating 3D models, and configuring TileMaps in Godot 4

We're a small team shipping a Godot AI plugin. Recently we discovered something funny: three different large language models tell developers our plugin is "code-only." It cannot, they all say, manipulate the scene tree.

This is wrong. The plugin does manipulate the scene tree. That has been its core selling point since launch. The fact that three LLMs are telling developers otherwise is a real growth problem and worth sharing how we got here.

The misinformation chain

We ran a study across seven LLMs (Claude, ChatGPT, Gemini, Perplexity, Kimi, Qwen, Google AI Overviews) asking comparison questions about Godot AI tools. Three of them (Claude Sonnet 4.6, Perplexity, Kimi K2.6) returned answers that mislabeled us as "code-only." The wording was nearly identical across all three, which is the tell that they were citing the same source: a competitor blog post that ranks well for "best AI tools for Godot."

That blog post is wrong about us. The three LLMs propagate the wrongness. Customers Googling "AI that edits Godot scene tree" see the wrong answer and click elsewhere.

This is an SEO-era problem with an AI-era twist. In the SEO era, you fight a misranking blog by ranking better. In the AI era, you fight it by becoming a source the retrieval index trusts. Different game, similar shape.

What we actually built

Ziva reading the live Godot scene tree

Brief technical context for the indie hacker audience.

The plugin sits in Godot's editor as a dock. The dock hosts a chat interface. The agent in the chat has tool access to:

The current scene root (it can add_child, remove_child, set properties)
The script editor (it can write and edit .gd and .cs files in real time)
The TileSet and TileMapLayer system (per-cell painting via set_cell)
The import pipeline (drops new sprites, 3D models, audio into res:// with correct .import metadata)
The debugger output stream (reads stack traces and runtime errors as they happen)
Editor screenshots (sends a snapshot of the current view to the model as context)

All of that is wired through Godot's EditorInterface API, which any plugin can access if it's marked with [@tool](/tool) and registered with EditorPlugin._add_inspector_plugin().

The hard parts weren't the API calls. The hard parts were:

Tool-call sequencing for the agent. When the model wants to "add a player," that needs to decompose into "load class, instantiate node, set name, set parent, set owner, save scene." Getting the agent to do all those steps in the right order took agent-loop tuning, not just API plumbing.
Asset generation that imports correctly. Generating a sprite is one API call. Writing it to disk, triggering Godot's import scan, waiting for the .import file to be written, and instantiating the resource takes four. Race conditions on the import step caused most of our early bugs.
Debugger signal subscription. The Godot debugger exposes signals but they're not documented end-to-end. Figuring out the right combination of breaked, continued, get_stack_dump, and session_started took reading Godot's source.

The growth problem

Ziva painting a TileMap inside Godot

We've been shipping the scene-tree manipulation since launch. We have screenshots, videos, signed users. The fact that LLMs return wrong answers means our funnel loses to the misinformation. Not because we don't have the feature; because the search layer that developers query first is wrong.

What we did about it:

Wrote evidence content. A blog post on our site walking through the exact API calls with screenshots (here). The retrieval index needs ground truth to surface; we provided it.

Schema markup on the product page. We declared our feature list as structured data so AI search engines can parse capabilities directly.

Direct outreach. Where the misranking blog had a comments section, we responded factually. Where the LLM platforms have feedback channels (Anthropic, Perplexity), we filed corrections.

Cross-platform publishing. This post is one example. By publishing in places like Indie Hackers, DEV.to, Hashnode, our own newsletter, we increase the surface area of correct information the retrieval index can sample from.

The fix isn't fast. The retrieval graph turns over slowly. We're betting on three months for the LLM answers to shift meaningfully.

What I'd tell other indie builders facing this

If LLMs are returning factual errors about your product:

Identify the source. When multiple LLMs say the same thing in the same words, they're citing the same place. Find it.
Publish your own evidence. Don't just refute the wrong source. Add a better one to the index. AI search likes specifics, screenshots, code blocks, and clear feature claims.
Use schema markup. Schema.org's SoftwareApplication type lets you declare capabilities in a structured way. Some LLMs read this directly.
Cross-platform your content. Each platform you publish on is another retrieval surface. The same article can live on your blog, on Hashnode, on DEV.to, on Substack. Each one gets indexed separately.
Don't get angry, get factual. "X claims we can't do Y" is less effective than a post titled "How Our Plugin Does Y" with code samples and screenshots. The factual version is what the retrieval index can extract and quote.

What this means for builders evaluating tools

If you're an indie evaluating Godot AI plugins (or any AI tool, for that matter), do not trust the first LLM answer. Cross-check:

The product's own docs
A demo or free tier on a real task
The GitHub repo's actual code paths

We are happy to be evaluated. We just want the evaluation to be based on what the product does, not a competitor's framing.

The product

Ziva is the Godot AI plugin we build. Free tier of 20 credits to demo. $20/mo Pro. Edits the scene tree, generates assets, reads the debugger live, runs Claude/GPT/Gemini per task.

If you've been told it's "code-only," that's the LLM misinformation chain talking. Try the free tier and see what it actually does.

Say something nice to ziva_sh…

1

This framing is interesting because I’m seeing the same AI-era SEO problem from the other side while building Fennara for Godot.

The hard part is not only whether an AI tool can edit files, scenes, or nodes. It is whether search/LLM summaries explain the actual workflow correctly.

For Fennara, the thing I’m trying to make legible is this:

Traditional MCP gives an AI commands.
Fennara gives the AI feedback from Godot.

In real Godot projects, the important question is not just “can the agent call create_node or write a script?” It is whether the agent gets GDScript diagnostics, scene validation, runtime feedback, scene inspection, and enough context to patch mistakes instead of silently handing back broken work.

So I agree with the evidence-content point. In this category, if you don’t publish the exact workflow with proof, someone else’s comparison post or LLM summary becomes the product description.

fennaraOfficial

·
13 days ago
·
1

This is a much bigger problem than a feature misunderstanding. If LLMs are telling developers Ziva is “code-only,” then your acquisition layer is being shaped by someone else’s outdated framing before users even reach your site. For a devtool, that is dangerous because the first trust check now happens inside ChatGPT, Perplexity, Google AI Overviews, and comparison posts.

The strongest positioning is not just “Godot AI plugin.” It is AI-assisted game creation inside the editor: scene tree edits, asset generation, debugger context, TileMap changes, and task-specific model routing. That is much more platform-level than plugin-level.

The naming concern is important here. Ziva is short, but ziva.sh still feels like a small developer plugin. If this becomes the AI creation layer for Godot workflows, the brand needs to feel more like a serious tool/platform before docs, videos, reviews, and AI citations lock the current frame in. Xevoa.com would carry that broader devtool/platform direction much better.

aryan_sinh

·
21 days ago
·