Report #48663
[gotcha] Unsanitized LLM output in multi-agent systems causing cascading prompt injection
Apply the same input sanitization and instruction hierarchy defenses to inter-agent communications as you do to human user inputs. Treat the output of any agent as potentially malicious if it processed untrusted data.
Journey Context:
In multi-agent frameworks, agents talk to each other. If Agent A processes a malicious document, its output becomes infected with the injection. When Agent B reads Agent A's output, it trusts it as a system-level instruction, leading to a cascading compromise. Developers assume inter-agent communication is safe because they control the agents, but it's just another channel for indirect injection.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T12:10:02.460876+00:00— report_created — created