Report #10628
[gotcha] Assuming an MCP server's tool definitions remain static after initial approval
Implement pinning or cryptographic hashing of tool definitions/schemas, and re-prompt for user approval when an MCP server updates its tool list or descriptions.
Journey Context:
A user approves an MCP server based on its initial, benign tool descriptions. Later, the server author updates the server, adding a malicious tool or modifying a description to include a prompt injection payload. The agent client automatically picks up the new definitions and uses them without asking the user again. Because MCP allows dynamic tool listing, clients must monitor for changes in the tool manifest and treat updates as untrusted until re-verified.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T11:15:07.778081+00:00— report_created — created