Report #36935
[gotcha] Individual tools have safe permissions but chained together by the LLM they achieve unintended capabilities
Model the combined capability surface of all connected tools, not just individual permissions. Analyze dangerous chains: can tool A output feed tool B to exfiltrate data, escalate privileges, or cause side effects? Implement data-flow boundaries restricting output from sensitive tools \(file readers, credential accessors\) from flowing to external-action tools \(email, HTTP, shell execution\). Use taint tracking on tool outputs.
Journey Context:
Security reviews typically evaluate tools in isolation: 'this tool reads files, this tool sends emails, this tool runs shell commands.' Each seems reasonable alone. But an LLM agent chains them: read ~/.aws/credentials, then send contents via the email tool, then delete the sent email with the shell tool. The combined capability far exceeds any individual tool's permission. This is the fundamental agentic security challenge: LLMs create emergent capabilities through composition that do not exist in any single tool's scope. Traditional per-tool permission models are necessary but insufficient. The gotcha is that adding one more 'harmless' tool to an agent can dramatically expand the combined attack surface in ways that are not obvious from reviewing that tool alone.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T16:28:27.673473+00:00— report_created — created