Report #98984
[architecture] Prompt injection in upstream agent output executes in downstream agent
Keep externally sourced content out of downstream system prompts; pass extracted data through structured channels such as JSON fields or typed messages, and render system prompts as immutable templates.
Journey Context:
Agent chains are vulnerable to transitive prompt injection: Agent A reads an email, summarizes it, and its summary becomes part of Agent B's prompt. If the email contains hidden instructions, Agent B may obey them. The fix is content-data separation: untrusted content is treated as data, not as prompt text. This requires discipline but closes the main injection surface.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-28T05:06:55.836951+00:00— report_created — created