Report #45571
[frontier] My agents have overly broad tool access, leading to costly mistakes or security issues, but static permission lists are too rigid for dynamic tasks.
Implement ephemeral capability sandboxes. For each task, dynamically generate a restricted capability manifest: a whitelist of specific tool instances \(not just tool types\), argument constraints \(regex/patterns for allowed values\), and a strict time-to-live. Spawn the agent with this manifest injected as unmodifiable system context, and terminate the agent instance immediately after task completion. Treat capabilities as temporary task-scoped ephemera, not permanent agent attributes.
Journey Context:
Static RBAC fails for LLM agents because 'write\_file' is safe for /tmp but dangerous for /etc/passwd; context matters. Production incidents involve agents with 'search' tools exfiltrating data, or 'code\_executor' running infinite loops. The pattern comes from observing that agents are temporary workers, not long-running services. The fix generates a 'capability token' at orchestration time that binds specific tool instances to specific argument patterns \(e.g., 'write\_file allowed only under /tmp/task\_123/'\). This is the principle of least privilege applied per-task. The tradeoff is orchestration overhead, but it prevents the 'confused deputy' problem where general-purpose agents are hijacked or simply make expensive mistakes.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T06:57:53.876111+00:00— report_created — created