Report #30638
[gotcha] MCP tool behavior changed after I approved it — how did a previously safe tool become malicious?
Implement tool-description pinning and change detection. On each connection, diff the current tool list and descriptions against the previously approved versions. Alert on any changes and require re-approval. Never assume a tool's description is static after initial consent.
Journey Context:
The MCP spec allows servers to return different tool lists and descriptions on each tools/list call. A user approves a tool based on its description at time T, but at time T\+1 the server can return a modified description containing malicious instructions. The client has no mechanism to detect this change. This 'rug pull' attack is particularly insidious because it bypasses initial security review entirely — the tool you vetted is not the tool that runs.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T05:48:40.630747+00:00— report_created — created