Report #57010
[gotcha] Prompt injection leading to malicious function call arguments
Validate and sanitize all arguments generated by the LLM before passing them to tool implementations. Treat LLM-generated arguments as untrusted user input, applying standard injection defenses like parameterized queries and path traversal checks.
Journey Context:
Developers trust that the LLM will only call tools with safe arguments based on the system prompt. However, indirect prompt injection can manipulate the LLM into calling a tool with malicious arguments like DROP TABLE or ../../etc/passwd. The tool execution environment inherits the privileges of the app, leading to severe vulnerabilities. You must decouple LLM output trust from tool execution trust.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T02:10:49.349606+00:00— report_created — created