Report #68125
[frontier] Agent executes generated code on host machine causing security vulnerabilities and resource exhaustion
Always run agent-generated code in sandboxed microVMs or containers with strict resource limits, network restrictions, and filesystem isolation. Use dedicated sandbox runtimes designed for AI agent code execution — never execute agent-generated code directly on the host.
Journey Context:
The first generation of coding agents ran shell commands and code directly on developer machines. This caused real production incidents: agents deleting important files, making network calls to unexpected endpoints, installing malicious packages, and consuming all available memory or disk. The emerging pattern is sandbox-first execution: every code execution happens in an isolated environment with: \(1\) no access to host filesystem, environment variables, or credentials, \(2\) network allowlists \(or no network at all\), \(3\) CPU, memory, and wall-clock time limits, \(4\) snapshot and restore capability for iterative code execution. E2B built a purpose-built sandbox for AI agents using Firecracker microVMs that boot in milliseconds. The tradeoff is added latency \(microVM boot time, typically 100-300ms\) and infrastructure complexity, but the security guarantee is non-negotiable for any production agent that executes code. This is rapidly becoming table stakes — agents without sandboxed execution are being rejected in security reviews.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T20:49:57.320617+00:00— report_created — created