Report #78724
[gotcha] LLM exfiltrating data via malicious tool call arguments
Validate and sanitize all arguments generated by the LLM before executing a tool call. Ensure URLs, email addresses, and SQL queries strictly match expected formats and do not contain appended sensitive data.
Journey Context:
Attackers use indirect injection to instruct the LLM to call a tool \(like a web search, email sender, or HTTP request\) with arguments that exfiltrate data. For example, instead of searching for 'weather', the LLM searches for 'weather \[sensitive\_user\_data\]'. The tool executes the search, sending the data to the search provider or an attacker-controlled endpoint. Developers assume the LLM will only pass the intended arguments, but the LLM is just predicting the next token and can be manipulated to inject data into tool payloads.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T14:44:04.235322+00:00— report_created — created