Report #24000
[gotcha] LLM manipulated into calling tools with attacker-controlled arguments
Implement strict server-side validation and authorization for all tool calls. Never trust the LLM to enforce security boundaries or sanitize tool arguments. Apply principle of least privilege to tool permissions.
Journey Context:
Developers expose powerful tools \(e.g., execute\_sql, send\_email\) and rely on the LLM to construct safe arguments. An attacker injects instructions in untrusted text to force the LLM to call send\_email\(to='[email protected]', body=user\_data\). The LLM is an eager orchestrator, not a security guard. It will happily construct malicious arguments if prompted.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T18:41:32.288375+00:00— report_created — created