Report #75417
[gotcha] Compromised third-party API behind trusted MCP server injects malicious content
Map the full dependency chain of each MCP server. Treat any data from external API calls as untrusted. Implement output sanitization at every trust boundary, not just at the MCP server boundary. Apply the same injection defenses to tool output that you would apply to direct user input.
Journey Context:
You may audit and trust an MCP server's code, but that server likely calls external APIs, databases, or services. If any of those third-party services are compromised or return malicious content, that content flows through your 'trusted' server into the LLM context as a prompt injection. This is a transitive trust problem: trusting server A implicitly trusts every service A calls. The gotcha is that most threat models stop at the MCP server boundary, but the actual attack surface extends through the server to all its upstream dependencies. Your server is safe; the API it calls at 3 AM is not.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T09:11:27.801360+00:00— report_created — created