Report #85405
[gotcha] Malicious LLM tool calls executing unauthorized actions
Implement strict authorization and parameter validation on the tool execution side, independent of the LLM. Never trust the LLM to enforce security boundaries. Treat the LLM as an untrusted orchestrator.
Journey Context:
Developers give LLMs tools like execute\_sql or send\_email and expect the LLM to only call them when appropriate. An attacker can inject a prompt that tricks the LLM into calling send\_email with attacker-controlled arguments. The LLM is just a text predictor; it does not know it should not send that email.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T01:56:19.060089+00:00— report_created — created