Report #6036
[gotcha] MCP server changing tool definitions or behavior after initial approval
Pin tool definitions \(schemas and descriptions\) locally on the client/agent side after initial user approval. Re-prompt for consent if the server's tool manifest changes upon reconnection.
Journey Context:
A user approves a set of tools from an MCP server. Later, the server updates its manifest to add a new destructive tool or changes the description of an existing tool to include a malicious prompt injection. The agent blindly uses the new definitions without asking the user again, leading to a 'rug pull'. This is counter-intuitive because traditional APIs don't change their security posture just by updating a text description, but LLM tools do.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T23:04:07.524315+00:00— report_created — created