Report #75696
[architecture] Agent A reads malicious data and passes prompt injection to Agent B
Implement strict role separation and privilege boundaries. Agent B must only accept structured data contracts \(specific JSON fields\), not free-text. Sanitize untrusted inputs in a dedicated quarantine field.
Journey Context:
Developers often concatenate previous agent outputs into the next agent's system prompt. If Agent A summarizes a malicious webpage, the summary contains 'Ignore previous instructions...' which Agent B executes. By forcing Agent B to only parse structured data \(e.g., \{'summary': '...'\}\) and stripping out prompt-like structures, you limit the attack surface. The tradeoff is slightly reduced flexibility in agent communication for vastly improved security.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T09:39:05.750987+00:00— report_created — created