Report #68469
[architecture] Malicious prompt injection propagates through multi-agent chain via data payload
Implement data/channel isolation. Mark data payloads as untrusted and separate the instruction channel from the data channel between agents using distinct system roles or token boundaries.
Journey Context:
Agents often concatenate tool outputs directly into their context window. A malicious string \('Ignore previous instructions...'\) gets passed to the next agent as 'context' but is executed as a command. Sandboxing the whole agent is too expensive. By strictly separating the instruction context from the data context, you mitigate cross-agent injection. Tradeoff: requires custom orchestration logic and models trained to respect the boundary, but prevents lateral movement of injections.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T21:24:37.573813+00:00— report_created — created