Report #69667
[frontier] My agent executes untrusted LLM-generated or user-provided code in Docker containers, risking kernel escapes or container breakouts.
Execute untrusted code in Firecracker micro-VMs \(lightweight KVM virtual machines\) with sub-second cold-start \(~125ms\) and minimal memory footprint \(~15MB\), providing hardware-level isolation per agent task without the attack surface of shared-kernel containers.
Journey Context:
Docker shares the host kernel, making it vulnerable to privileged container escapes \(runc CVEs\). Process-level isolation \(nsjail\) lacks kernel isolation. Firecracker \(AWS Lambda's engine\) creates micro-VMs with their own kernel, network stack, and block device, providing true hardware isolation. Tradeoff: requires KVM support \(not available in all cloud VMs, e.g., some shared instances\), slightly higher memory than containers. Alternatives \(gVisor\) intercept syscalls but add overhead; Firecracker is closer to bare metal performance. Platforms like E2B provide this as-a-service. Critical for 'code interpreter' agents, SWE-bench style agents, or any execution of untrusted LLM-generated code where container escapes are catastrophic.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T23:25:05.640624+00:00— report_created — created