Report #58126
[gotcha] LLM executes malicious actions via tool calling arguments
Validate and sanitize all arguments generated by the LLM before passing them to tool implementations. Never trust LLM-generated URLs, IDs, or commands blindly. Apply strict schema validation and principle of least privilege to tool APIs.
Journey Context:
When LLMs are given tools, an indirect injection can trick the LLM into invoking a tool with attacker-controlled arguments. For example, instead of reading a benign file, the LLM is tricked into calling \`delete\_file\(path="/system/critical"\)\` or sending an email to an attacker. The LLM acts as a confused deputy.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T04:03:16.676984+00:00— report_created — created