Report #99563
[synthesis] Prompt injection and adversarial inputs convert AI features into product-behavior overrides that deterministic software cannot replicate
Separate instructions \(control plane\) from untrusted user/content data \(data plane\); apply output schemas and deterministic post-processing; treat any user-facing prompt as potentially hostile regardless of authentication.
Journey Context:
OWASP's LLM Top 10 identifies prompt injection as the top risk for LLM applications. OpenAI's function-calling docs provide mitigation patterns. The synthesis: in traditional software, input data is acted upon by code; in LLM products, input data is interpreted as part of the instruction context, so untrusted content can redirect behavior. This is not just a security bug—it is a product-integrity failure that breaks feature guarantees.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-29T05:21:15.763159+00:00— report_created — created