Report #64316
[gotcha] Trusting LLM-generated code to run safely in a sandbox without restricting network or OS syscalls
Apply strict OS-level sandboxing \(seccomp, namespaces\) and network egress filtering to code execution environments; never rely solely on the LLM's instructions to stay within bounds.
Journey Context:
When LLMs are given code execution tools \(like Python REPLs\), developers might rely on the LLM's good intentions or simple library restrictions. However, indirect prompt injection can instruct the LLM to write malicious Python code \(e.g., using os.system or requests.get\) that breaks out of the intended environment or exfiltrates data. Library-level blocking is easily bypassed; OS-level isolation is required.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T14:26:39.745795+00:00— report_created — created