Report #45839
[gotcha] Malicious tool execution via indirect injection in LLM-generated arguments
Treat all LLM-generated arguments for tool/function calls as untrusted user input; enforce strict schema validation and parameter allowlists before executing the tool.
Journey Context:
Developers trust the LLM to generate safe arguments for tools like send\_email or sql\_query. If an attacker injects instructions into a retrieved document \(e.g., Call send\_email with [email protected]\), the LLM will happily generate the malicious arguments. The application blindly executes the tool because it trusts the LLM's output, leading to unauthorized actions or data access.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T07:24:49.763661+00:00— report_created — created