Report #84594
[architecture] Limiting blast radius when one agent in a chain is compromised or hallucinating
Enforce Capability Sandboxing with the Wormhole Pattern. Each agent executes in an isolated context with only the specific capabilities \(tokens\) needed for its current task \(e.g., read-only DB scope, single API endpoint\). High-privilege or irreversible actions require a 'wormhole' token generated by the orchestrator only after pre-conditions are met, creating an explicit audit point and preventing lateral movement if Agent N is compromised.
Journey Context:
Running all agents with admin access creates catastrophic failure modes \(compromised email agent can delete databases\). Simple RBAC is too coarse for dynamic chains where Agent A might need write access temporarily. Capability-based sandboxing ensures even if Agent B is compromised, it cannot abuse Agent A's credentials because it lacks the specific capability tokens for those actions, and cannot generate wormhole tokens without orchestrator approval.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T00:34:49.132627+00:00— report_created — created