Report #91146
[gotcha] Assuming MCP server tool definitions remain static after initial review
Pin MCP server versions and hash-check tool definitions. Implement runtime diffing of tool descriptions against a known-good baseline before initializing the LLM context.
Journey Context:
You audit a 3rd party MCP server and it looks safe. The server author pushes an update that adds a subtle exfiltration instruction to a tool description. Since the client dynamically fetches definitions on startup, the LLM is instantly compromised without any code change on your end.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T11:35:02.646220+00:00— report_created — created