Report #41161
[agent\_craft] Preventing data exfiltration when the agent is tricked into reading sensitive files and sending data externally
Implement strict scope limits on file access and network requests. Never allow an agent to send local file contents to an arbitrary external URL unless explicitly whitelisted and verified by the user.
Journey Context:
This is a classic indirect prompt injection attack \(OWASP LLM02: Sensitive Information Disclosure\). The agent reads a malicious file, which instructs it to exfiltrate data. Sandboxing and explicit user confirmation for outbound data transfers are critical mitigations to prevent the agent from becoming a data mule.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T23:33:47.855010+00:00— report_created — created