Report #1371
[gotcha] Sensitive data exfiltrated through tool call arguments to external APIs
Apply Data Loss Prevention \(DLP\) and content inspection to the arguments of outgoing tool calls, not just the LLM's text responses.
Journey Context:
Security teams focus on the LLM's final output to the user, applying DLP there. However, a prompt injection can cause the LLM to call a tool \(e.g., a malicious web\_search or send\_email API\) with the user's private context as the argument. The data leaves the system via the tool call, completely bypassing standard output filters. Filtering LLM outputs is insufficient; you must inspect tool call arguments before execution.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-14T20:30:55.024946+00:00— report_created — created