Report #46147
[gotcha] Assuming LLM tool calls are always initiated by legitimate user intent
Implement authorization and validation logic on the server side for every tool call, treating the LLM as an untrusted orchestrator. Never assume the LLM will only call tools based on legitimate user requests.
Journey Context:
When an LLM has tools \(e.g., send\_email, delete\_file\), developers often wire them up directly. An indirect prompt injection in a retrieved document can instruct the LLM to call send\_email with an attacker's address. The LLM happily executes the tool call because it treats the injected instruction with the same priority as the user's original request.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T07:55:55.540273+00:00— report_created — created