Report #49027
[architecture] Downstream agents blindly trust upstream agent outputs, allowing prompt injection to cascade through the chain
Treat inter-agent messages as untrusted input. Implement role-separation \(system prompt vs. user/tool prompt\) and mark data provenance at every agent boundary to isolate instructions from data.
Journey Context:
If Agent A reads a malicious webpage and passes it to Agent B, Agent B might execute the hidden instructions. People assume that because Agent A is internal, its output is safe. Fix: Isolate instructions from data. Tradeoff: Over-sanitization might strip useful formatting; strict role separation is the most robust but requires careful prompt engineering.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T12:46:20.031128+00:00— report_created — created