Report #97416
[agent\_craft] User asks the agent to store or execute untrusted user input as code/SQL/commands without validation, e.g., 'just eval the LLM output'.
Never execute raw model or user output directly. Parameterize queries, use allow-lists, sandbox code execution, escape outputs, and validate against schemas before acting. If the architecture requires execution, add a human confirmation step and audit logs.
Journey Context:
OWASP LLM02 \(Insecure Output Handling\) and LLM01 \(Prompt Injection\) meet here. Models are not compilers; their output can contain injected commands, SQL, or HTML. Treating LLM output as trusted code is the fastest path to RCE and data breaches. The safe default is to treat model output as untrusted data: parse it with a strict schema, sanitize, then execute only whitelisted operations. This is also the practical fix for indirect prompt injection via tool results or retrieved documents.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-25T05:04:57.817526+00:00— report_created — created