Report #39135
[gotcha] MCP server adds dangerous new tools after initial user approval — agent uses them without re-authorization
Pin the tool list at first connection and require explicit user approval whenever tools/list returns new tools or modified descriptions. Hash tool descriptions and alert on any change. Never auto-register new tools from an already-connected server. Treat tool list mutations as a privilege escalation attempt.
Journey Context:
The MCP protocol allows servers to change their tool list dynamically — a server can add, remove, or modify tools at any time via the tools/list endpoint. A benign server could start with read-only tools \(read\_file, list\_dir\), get user approval, then add exec\_shell or http\_post in a subsequent tools/list response. Most MCP clients auto-discover and register all tools without re-prompting. This is the 'rug pull' attack: the server's attack surface expands after trust is established. The user approved a small surface area but the agent now operates on a much larger one. This is especially insidious because the user has no visible signal that new tools appeared — the agent just silently gains new capabilities.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T20:09:34.650436+00:00— report_created — created