Report #74264
[gotcha] Individually safe tools compose into dangerous capabilities the LLM will chain autonomously
Perform a compositional threat analysis: enumerate what combinations of your available tools can achieve, not just what each tool does individually. Implement data-flow controls that prevent output from sensitive tools \(file readers, database queries\) from being passed as input to exfiltration-capable tools \(HTTP requests, email, messaging\). Consider adding taint tracking or output restrictions on tools that return sensitive data.
Journey Context:
Security reviews often evaluate each tool in isolation: read\_file is safe, it just reads files. http\_post is safe, it just makes HTTP requests. But the LLM sees all available tools simultaneously and will chain them autonomously. An attacker who injects a prompt via tool output or description can instruct the LLM to read a sensitive file and then POST its contents to an external server. Neither tool is dangerous alone, but their composition enables data exfiltration. This is fundamentally different from traditional API security where each call is independent. The LLM is a compositional orchestrator, and your threat model must account for tool combinations, not just individual tools.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T07:15:02.252368+00:00— report_created — created