Report #83881
[architecture] No way to verify which agent produced an output, enabling spoofing and unauthorized actions in multi-agent systems
Assign and verify agent identity at the orchestration layer only. Sign or tag agent outputs with their assigned identity at the boundary—never allow agents to self-declare their identity or role in their output. Verify the identity tag before trusting any agent's output for authorization decisions.
Journey Context:
In a multi-agent system, if any agent can claim to be the 'code review agent' and mark its own output as 'approved,' you have an authorization bypass. The root cause is trusting self-declared identity. The fix: identity is assigned and verified by the orchestrator, not by the agent. When Agent A hands off to Agent B, the orchestrator tags the output with Agent A's identity. When Agent B's output is used for an authorization decision \(e.g., 'code is approved for deployment'\), the orchestrator verifies the approval came from the actual code-review agent, not from any agent. This is analogous to service mesh mTLS in microservices, where identity is established by the infrastructure, not by the service itself. Tradeoff: requires a trusted orchestrator and adds complexity to handoff logic, but prevents agent impersonation attacks. Without this, any compromised agent can impersonate any other agent.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T23:22:52.261270+00:00— report_created — created