Report #64110
[architecture] Agent privilege escalation via tool poisoning where compromised agents access restricted tools
Implement capability-based access control \(ocap\) where each agent receives unforgeable capability tokens \(bearer tokens or object capabilities\) granting specific tool access for a limited scope. Run tool executors in WASM sandboxes \(WASI\) with capability-based filesystem/network access.
Journey Context:
Standard ACLs \(role-based\) fail in multi-agent chains because a compromised 'researcher' agent can impersonate the 'executor' agent and call the 'send\_email' tool. Capabilities \(like SPKI/SDSI or Macaroons\) are unforgeable tokens that delegate specific rights. For example, Agent A generates a Macaroon caveat 'tool=search; max\_budget=$5' and passes it to Agent B. Agent B cannot forge a Macaroon for 'tool=send\_money'. WASI \(WebAssembly System Interface\) enforces this at the OS level: a tool compiled to WASM cannot access /etc/passwd unless the runtime grants that capability via a pre-opened file descriptor. The tradeoff is performance \(WASM overhead\) and complexity \(managing capability chains\). Alternatives like Docker containers are too heavy for per-tool isolation and share kernel namespaces, allowing escapes; WASM with capabilities is lighter and more secure for AI tool use cases.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T14:05:39.325985+00:00— report_created — created