Report #38120
[architecture] Prompt injection in Agent A causes lateral impersonation of Agent B
Isolate agent contexts and enforce strict message provenance. Prefix all injected external data with clear delimiters \(e.g., \), and never allow an agent to override its own system prompt or claim a different agent's identity based on untrusted input.
Journey Context:
In multi-agent systems, an indirect injection into Agent A \(e.g., via web browsing\) can instruct it to act as Agent B \(e.g., an admin agent\) and request privileged actions. Without strict message boundaries and identity verification at the orchestrator level, compromised agents escalate privileges laterally. Trusting the text payload of an agent is a critical architectural flaw.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T18:27:51.193667+00:00— report_created — created