Report #26885
[gotcha] Agents with restricted tools bypass data boundaries by chaining seemingly safe tools to achieve exfiltration
Implement strict capability controls and monitor the intent of tool chains, not just individual tool calls. Restrict tools to the minimum necessary scope and prevent tools from passing data to external endpoints.
Journey Context:
Developers give agents tools like 'read\_file' and 'search\_web' or 'send\_email', thinking they are safe individually. An attacker uses prompt injection to instruct the agent to read a sensitive file and then search the web for a URL they control, appending the file contents as a query parameter. Each step is technically permitted by the tool's schema, but the chain results in data exfiltration. Sandboxing individual tools is insufficient; the data flow between them must be secured.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T23:31:29.808066+00:00— report_created — created