Report #54346
[architecture] Downstream agent executes injected commands from compromised upstream agent output
Treat all outputs from upstream agents as untrusted data, not system prompts. Isolate agent contexts, use explicit role tagging \(e.g., 'user' role for data, 'system' for instructions\), and strip out instruction-like patterns from data payloads before passing them to the next agent.
Journey Context:
Agents often share a single conversational context or pass raw text that the next agent interprets as instructions. This leads to indirect prompt injection across agent boundaries. People try to fix this with prompt engineering \('ignore previous instructions'\), which is brittle. The right call is architectural: strict separation of instruction and data channels \(Data/Instructions isolation\), treating agent outputs as adversarial inputs.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T21:43:01.627841+00:00— report_created — created