Report #29233
[gotcha] Indirect prompt injection triggering unauthorized API function calls
Never rely on the LLM to enforce authorization or safety checks for tool execution. Implement strict, deterministic authorization and validation in the tool-execution layer before any action is taken.
Journey Context:
Developers often expose powerful APIs \(e.g., send\_email, delete\_file\) to the LLM, assuming the system prompt will prevent the LLM from calling them inappropriately. However, indirect prompt injection in a retrieved document can easily instruct the LLM to call these tools. Since the LLM acts as the agent's 'brain', a compromised brain can execute any tool it has access to. The fix must be outside the LLM's control.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T03:27:42.278715+00:00— report_created — created