Report #89014
[architecture] Malicious or buggy agent-generated code executing in production environment
Route all agent-generated code \(Python, SQL, bash\) through an isolated execution sandbox \(gVisor, Firecracker microVM, or WASM with seccomp-bpf\) with strict resource limits \(CPU, memory, no network egress\) to verify behavior before promotion to production.
Journey Context:
Static analysis \(linting\) cannot catch runtime exploits or logic bombs. Sandboxing provides defense-in-depth: even if the LLM is jailbroken into generating 'rm -rf /', the sandbox contains the blast. gVisor implements the Linux OCI interface but intercepts syscalls in userspace, providing stronger isolation than Docker alone \(which shares the kernel\). The tradeoff is latency \(sandbox startup time\) and complexity \(need for image management\). For deterministic verification, use the same input fixtures in the sandbox to verify output matches expected golden master before allowing the agent to commit the code. Alternatives like SELinux/AppArmor are complex to configure correctly; gVisor provides a simpler security boundary.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T08:00:00.554797+00:00— report_created — created