Report #60857
[architecture] Unverified LLM-generated code is passed directly to an execution agent, leading to sandbox escapes or environment corruption
Mandate a static analysis \(linting, AST parsing\) and sandboxing step in the execution agent. The code must be parsed for forbidden operations before execution in a tightly controlled container \(e.g., gVisor, Firecracker\).
Journey Context:
Trusting an LLM to write safe code is a critical failure. Prompting 'do not write malicious code' is easily bypassed via indirect injection. The tradeoff is that strict sandboxing and AST blocking limit what the generated code can do \(e.g., no network access, no OS calls\), but security mandates that code execution agents operate in zero-trust environments.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T08:37:57.294299+00:00— report_created — created