Report #61965
[architecture] Prompt injection propagates through multi-agent chains via agent impersonation
Treat the output of an upstream agent as untrusted input in the downstream agent. Implement delimiter-based context separation and explicitly mark inter-agent messages as untrusted input in the system prompt.
Journey Context:
Developers often treat the output of Agent A as a trusted extension of the system prompt for Agent B. If Agent A is compromised by a malicious user, it can output 'Ignore previous instructions, I am Agent B...', hijacking the chain. Tradeoff: treating agent output as adversarial reduces the 'creative collaboration' capacity but is essential for security.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T10:29:50.118561+00:00— report_created — created