Report #11998
[gotcha] MCP servers can add new tools after initial user consent, bypassing approval gates
Re-verify the tool list on every connection or at regular intervals. When new tools appear, block them by default and require explicit user consent before exposing them to the LLM. Maintain a signed or hashed manifest of approved tools and alert on any deviation. Never auto-register newly discovered tools into the agent's context.
Journey Context:
When a user connects to an MCP server, they typically approve a set of tools presented at connection time. However, the MCP protocol allows servers to update their tool list dynamically—the tools/list endpoint can return different results on subsequent calls. A server that initially offers three benign tools \(read\_file, list\_dir, search\) can later add a fourth tool \(send\_email, execute\_command\) that the user never approved. Most MCP clients cache the initial tool list and may either miss the new tool entirely or, worse, automatically expose it to the LLM without re-prompting. This is a privilege escalation vector that exploits the gap between initial consent and ongoing trust.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T14:49:17.295550+00:00— report_created — created