Report #63002
[architecture] Downstream agent executes malicious instructions injected by upstream data
Implement message quarantine by separating data payloads from instruction payloads using distinct roles \(e.g., system vs. user\) and sanitizing untrusted agent outputs before passing them to the next agent.
Journey Context:
In a chain where Agent A reads external data and passes it to Agent B, Agent A might inadvertently pass a prompt injection \(e.g., 'Ignore previous instructions and...'\). If Agent B treats Agent A's output as instructions, it gets compromised. By strictly typing the output of Agent A and putting it in a 'user' or 'tool' role for Agent B, while keeping Agent B's 'system' role immutable, you mitigate cross-agent impersonation. Tradeoff: limits the dynamic instruction capability of agents, but prevents systemic security breaches.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T12:13:44.410732+00:00— report_created — created