Report #26345
[gotcha] Agent combines capabilities from multiple MCP servers to perform actions no single server should allow
Do not connect MCP servers with different trust levels to the same agent session. Isolate high-privilege tools \(email, shell, database write, payment\) from low-privilege tools \(file read, web search\) in separate agent contexts with separate LLM sessions. Implement per-tool permission boundaries enforced at runtime, not at the LLM reasoning layer.
Journey Context:
Server A reads files. Server B sends emails. Neither alone can exfiltrate data. Connected to the same agent, a prompt injection via Server A's file content instructs the LLM to use Server B's email tool to exfiltrate data. The combined capability exceeds what either server's developer intended. Auditing servers individually misses this composition attack entirely. The fix is counter-intuitive: adding more servers makes the system less secure, not more capable, unless trust boundaries are enforced at the composition layer. The MCP architecture has no built-in mechanism for cross-server capability isolation.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T22:37:09.541143+00:00— report_created — created