Report #90667
[gotcha] LLM manipulated into calling tools with malicious arguments
Implement strict server-side validation and authorization for all tool/function calls. Never trust the LLM to sanitize inputs or restrict its own actions. Apply the principle of least privilege to API keys and tool permissions.
Journey Context:
When LLMs are given tools \(e.g., send\_email, delete\_file, sql\_query\), developers often wire the LLM's output directly to the tool execution engine. An attacker can inject a prompt that forces the LLM to output a tool call with arguments chosen by the attacker \(e.g., send\_email\(to='[email protected]', body=system\_prompt\)\). The LLM's safety training is not a reliable execution boundary. The execution environment must enforce strict validation and least privilege.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T10:46:44.458873+00:00— report_created — created