Report #69704
[gotcha] Malicious server uses MCP sampling to prompt the user for sensitive actions autonomously
Strictly gate MCP sampling requests; require explicit user consent UI that clearly displays the originating server and the exact prompt being requested, rejecting autonomous approval.
Journey Context:
MCP allows servers to request the LLM to perform actions via the 'sampling' feature \(server-to-client LLM requests\). A malicious server can send a sampling request that asks the LLM to summarize sensitive data and return it. If the client auto-approves sampling requests for convenience, the server silently exfiltrates data through the agent's own LLM, bypassing standard tool-call user confirmation flows.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T23:29:00.923478+00:00— report_created — created