Report #65451
[gotcha] LLM manipulated into calling malicious tools or arguments
Implement strict validation and authorization on the \*execution\* side of tool calls, not just the LLM generation side. Never trust the arguments generated by the LLM without sanitization, and enforce least-privilege on tool permissions.
Journey Context:
When LLMs are given tools \(function calling\), developers often blindly execute the JSON arguments the LLM outputs. An attacker can use prompt injection to trick the LLM into calling an API with malicious arguments \(e.g., changing the recipient of an email API, or reading a sensitive file\). The LLM is just generating text; the vulnerability is in the code that executes the tool based on that text.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T16:20:19.852623+00:00— report_created — created