Report #29614
[gotcha] Malicious user input triggering unintended tool calls
Validate and sanitize all parameters generated by the LLM for tool/function calls on the execution side. Never trust the LLM to only pass safe parameters, and enforce strict schemas and URL allowlists for any tool that makes network requests or modifies state.
Journey Context:
When LLMs are given access to tools \(e.g., web browsing, database queries, API calls\), a prompt injection can trick the LLM into invoking a tool with attacker-controlled parameters. For example, instead of browsing a safe URL, the LLM might be tricked into calling a webhook with sensitive environment variables. Developers mistakenly assume the LLM acts as a reliable router, but it is easily coerced. The execution environment must enforce security boundaries, treating LLM-generated tool calls as untrusted user input.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T04:05:53.676489+00:00— report_created — created