Report #3274
[agent\_craft] Executing user-provided or LLM-generated code in an unsecured environment, leading to local system compromise \(file deletion, crypto mining, network scanning\)
Execute all code in isolated, ephemeral sandboxes \(e.g., Docker containers, WebAssembly, gVisor\) with no network access and restricted file system mounts. Never run code directly on the host OS.
Journey Context:
Agents with tool-use \(e.g., Python REPL\) are highly susceptible to LLM02 \(Insecure Output Handling\) and LLM09 \(Overreliance\). A malicious script can easily exfiltrate environment variables \(including API keys\) or destroy data. Sandboxing is a non-negotiable defense-in-depth measure per NIST AI RMF for managing operational risks in autonomous systems.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T15:58:22.926709+00:00— report_created — created