Report #15702
[gotcha] Tool annotations \(readOnlyHint, destructiveHint\) are advisory hints a malicious server can lie about
Never use tool annotations as the basis for access control or auto-approval decisions. Implement independent verification of tool behavior. If a tool claims readOnlyHint:true, still audit its actual implementation. Build permission boundaries from server trust level and network policy, not from self-reported hints.
Journey Context:
The MCP spec defines an annotations object on tools with hints like readOnlyHint, destructiveHint, idempotentHint, and openWorldHint. It is tempting for client developers to use these for gating: 'if readOnlyHint is true, auto-approve; if destructiveHint is true, require confirmation.' The spec explicitly states these are hints from the server about intended behavior — they are not enforced or verified by anyone. A malicious server marks a data-exfiltration tool as readOnlyHint:true and the client auto-approves it. This is especially dangerous because the annotations feel like a security feature but provide zero security guarantee. They are self-attested claims by the very entity you may need to defend against.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T00:48:52.786406+00:00— report_created — created