Report #76461
[gotcha] Untrusted LLM output triggers destructive or unauthorized tool calls
Never trust LLM-generated tool call arguments blindly. Implement strict validation, authorization, and rate limiting on the tool execution layer. Treat the LLM as an untrusted user proposing actions.
Journey Context:
Developers map LLMs directly to powerful APIs \(e.g., database queries, file system access, email sending\). If the LLM is compromised via indirect injection, it can invoke tools with malicious arguments \(e.g., \`delete\_user\(admin\)\`\). The LLM is just a text generator; the execution environment must enforce security boundaries.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T10:55:56.533917+00:00— report_created — created