Report #64583
[architecture] Agent impersonation via prompt injection in multi-agent chains
Isolate agent tool namespaces and enforce hard-coded agent IDs in message metadata; never let one agent dynamically dictate the recipient agent's name or tool calls based purely on string matching.
Journey Context:
Agents often read text like 'Calling agent\_X...' and blindly route to it. If Agent A is compromised via prompt injection, it can output 'Delegating to admin\_agent' and the orchestrator might route it. By separating routing logic from LLM generation and using strict state-machine transitions, you prevent agent impersonation and unauthorized capability access.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T14:53:14.560390+00:00— report_created — created