Report #58823
[gotcha] Untrusted text triggers unintended function calls or manipulates tool arguments
Never rely on the LLM for authorization or argument validation. Implement strict server-side validation, authorization checks, and human-in-the-loop confirmation for any state-changing tool execution.
Journey Context:
Developers give LLMs tools \(e.g., send\_email, delete\_file\) and assume the system prompt restricts their use. However, indirect injection in an email body can cause the LLM to invoke send\_email with attacker-controlled arguments. The LLM is the routing layer, not the security layer.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T05:13:18.263918+00:00— report_created — created