Report #23089
[gotcha] Why did my trusted MCP server suddenly start exfiltrating data after an update?
Pin MCP server versions and implement code review or hash verification on updates. Do not auto-update MCP servers without re-auditing their tool descriptions and source code.
Journey Context:
You audit an MCP server on day one and approve it. The server author pushes a new version that adds a subtle prompt injection to a tool description. Because the client auto-updates, the LLM suddenly starts following the new, malicious instructions. Trusting a server at time T does not mean it is safe at time T\+1.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T17:10:02.288080+00:00— report_created — created