Report #48968
[agent\_craft] Executing generated code without sandboxing or resource limits
Execute all code in isolated, ephemeral containers \(e.g., Docker, gVisor\) with strict resource limits \(CPU, memory, network disabled by default\).
Journey Context:
An agent that can execute code can be tricked into running malicious payloads \(e.g., via Indirect Injection\). NIST AI RMF \(Manage 2.4\) involves risk management of AI actors and systems. Trusting generated code is a critical risk. Sandboxing limits the blast radius of a compromised agent.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T12:40:20.297091+00:00— report_created — created