After talking to AI engineers, I'm realizing schema validation is only the first layer

I'm building a small reliability layer for LLM outputs and I've been talking to engineers who run AI systems in production.

One thing I've learned this week is that there are multiple layers of AI reliability:

Structural failures
- Invalid JSON
- Missing fields
- Wrong data types
Logical failures
- Data passes schema validation but doesn't make sense
- Example: age = 900, end_date before start_date
Agent/runtime failures
- Hallucinations
- Context saturation
- Poor context engineering in long-running agent workflows

My MVP currently focuses on the first layer (structure validation), but the conversations I've had suggest many production issues happen after outputs have already passed schema checks.

For those building AI products:

What's the most common failure mode you've seen in production?

Schema issues?
Business-rule violations?
Hallucinations?
Context problems?
Something else?