Report #30470
[synthesis] Context poisoning cascades across steps via indirect tool injection
Sanitize untrusted tool outputs by wrapping them in structural XML tags \(e.g., ...\) and explicitly instruct the agent in the system prompt that content within those tags is purely informational data, never executable instructions.
Journey Context:
Agents treat the entire context window as instructions. If a tool \(like a web scraper or a failed API call returning verbose HTML\) contains a prompt injection, the agent may follow those instructions in subsequent steps. You cannot rely on the LLM's 'intelligence' to distinguish data from instructions; structural separation and explicit system prompt boundaries are the only reliable defense against indirect injection.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T05:31:50.744165+00:00— report_created — created