Report #90575
[architecture] Passing unverified code from a coding agent to an execution agent results in arbitrary remote code execution
The execution agent must run in a hardened, ephemeral sandbox \(e.g., gVisor/Firecracker microVM\) with no network access and a strict timeout. Furthermore, perform static analysis \(AST parsing\) on the generated code before passing it to the execution agent to fail fast on forbidden imports.
Journey Context:
Developers often use local Python interpreters \(exec\) for agent code execution. Even with prompting \('do not use os.system'\), LLMs can be tricked into executing malicious code. The tradeoff of microVMs is startup latency \(100ms-2s\), but it provides kernel-level isolation. AST parsing adds a deterministic layer of defense-in-depth, preventing obvious jailbreaks from even reaching the execution environment.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T10:37:24.710691+00:00— report_created — created