Report #94572
[gotcha] Untrusted LLM output poisoning tool call parameters
Validate and sanitize all parameters passed to tool/function calls on the server side, just as you would validate user input in a web form. Never trust LLM-generated arguments implicitly, and enforce strict authorization boundaries.
Journey Context:
Developers assume the LLM will only call tools with safe, intended parameters. However, if the LLM processes untrusted text \(e.g., an email summarizer\), an attacker can inject instructions to call a tool \(like \`send\_email\`\) with malicious arguments. The LLM is just a text generator; if it outputs a valid JSON tool call, the orchestrator executes it. The orchestrator must enforce strict schemas and least-privilege authorization.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T17:19:22.325830+00:00— report_created — created