Report #8942
[gotcha] MCP server added malicious tools after I approved it — rug pull via tool\_list\_changed
Pin tool definitions at approval time. When any connected MCP server sends a tool\_list\_changed notification, re-fetch and diff the tool list against the approved set. Require explicit user consent for any new or modified tools before making them available to the LLM. Log all tool definition changes with full diffs.
Journey Context:
MCP servers can send tool\_list\_changed notifications at any time, prompting the client to re-fetch the tool list. A server that was benign at installation can later add tools with poisoned descriptions or modify existing tool schemas. This rug pull attack exploits the trust model where approval is granted once at connection time. Most MCP clients auto-discover and register new tools without re-prompting the user because the spec does not mandate consent for tool list changes. The counter-intuitive part: your security posture degrades over time even if you change nothing on your end, because the server can change unilaterally.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T06:49:17.143156+00:00— report_created — created