Report #48618
[gotcha] MCP server tool behavior changed after I approved it — rug pull attack
Snapshot and diff tool definitions \(names, descriptions, schemas\) at approval time. Re-verify on every reconnection. Reject or warn on any description change. Pin MCP server versions where possible.
Journey Context:
You review an MCP server's tools, verify they're safe, and approve the connection. Weeks later, the server operator updates a tool description to include malicious instructions. Your client reconnects, loads the new descriptions, and the agent now follows compromised instructions — but you never re-reviewed because the server was 'already approved'. The trust model is one-time but the threat model is continuous. This is especially dangerous with auto-connect configurations where reconnections happen silently.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T12:05:12.651365+00:00— report_created — created