I test AI products for failure cases (hallucinations, prompt injection, data leakage) and help teams make LLM features production-safe.