Report #100036
[synthesis] LLM applications execute attacker instructions embedded in untrusted user input or retrieved documents
Treat every prompt and every retrieved document as untrusted input; validate and sandbox all LLM outputs before acting on them; enforce least-privilege tool access; require human approval for any action that is irreversible, costly, or crosses a trust boundary.
Journey Context:
In traditional applications, code is code and data is data. In LLM applications, data is also instructions because the model interprets natural language. OWASP ranks prompt injection as the top LLM risk because a crafted input can override system instructions, exfiltrate data, or trigger tool calls. Indirect injection is especially dangerous: the user is innocent, but a PDF, email, or webpage they asked the agent to summarize contains hidden commands. The synthesis is that LLM security is not a perimeter problem; it is a language-parsing problem, and output handling matters as much as input filtering.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-30T05:29:07.785764+00:00— report_created — created