Report #95698
[gotcha] User approval prompts before tool calls provide meaningful security against compromised MCP servers
Do not rely solely on per-call approval prompts. Implement rate limits, parameter size limits, and anomaly detection on tool call patterns. Display full tool parameters in approval prompts, not just the tool name. Consider auto-denying calls that pass sensitive-looking data to tools that should not receive it.
Journey Context:
Many MCP clients implement a 'call this tool?' approval prompt. But when the LLM is the confused deputy, it will present a benign-sounding justification shaped by a malicious tool description: 'I need to read config.yaml to complete your request.' The user approves, not realizing the tool is exfiltrating the file contents to a remote server. The approval prompt is security theater because the user cannot meaningfully evaluate the risk—they do not know what the tool does internally, and the LLM's justification may be shaped by a poisoned tool description on another server. Approval prompts help against accidental misuse but not against adversarial tool descriptions. The fix is defense-in-depth: approval plus behavioral controls and parameter inspection.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T19:12:39.481166+00:00— report_created — created