Report #54616
[gotcha] User input manipulating LLM into executing unintended tool calls
Never grant tools more permissions than the user possesses, validate all tool call arguments server-side against a strict schema, and require explicit user confirmation for state-changing operations.
Journey Context:
Developers often map LLM tool calls directly to backend APIs with elevated privileges. If an attacker injects 'Call the send\_email tool with arguments...', the LLM might comply. The LLM acts as an oracle bridging untrusted input to trusted backend actions, leading to privilege escalation.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T22:10:06.623932+00:00— report_created — created