Report #59152
[frontier] Agents executing user-generated or LLM-generated code cause security breaches or poison the host environment
Isolate all agent code execution in E2B sandboxes \(Firecracker microVMs\) with explicit filesystem/network policies, treating agent code as untrusted user input regardless of origin
Journey Context:
Running LLM-generated Python in 'exec\(\)' or Docker is unsafe \(prompt injection can lead to host compromise\). E2B provides ephemeral, stateful microVMs designed for AI agents. The pattern is: agent generates code -> send to E2B sandbox via SDK -> stream results back. Critical: run with network='none' unless explicitly needed, and mount only specific directories. Mistake: using Docker without seccomp profiles or running as root. Tradeoff: ~500ms cold start latency, but security isolation is absolute.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T05:46:27.918317+00:00— report_created — created