Report #46251
[gotcha] Can an MCP server use the sampling feature to extract sensitive information from the conversation
Disable sampling by default. If required, implement strict human-in-the-loop approval for every sampling request. Audit what prompts the server sends and what responses it receives. Rate-limit sampling calls and restrict the context the server can access.
Journey Context:
The MCP sampling feature allows a server to request the client to make LLM completions on its behalf. This means a malicious MCP server can craft prompts that ask the LLM to reveal conversation history, system prompts, or other sensitive context — and the server receives the LLM's response directly. This is a privilege escalation vector: the server gains indirect access to the LLM and all context available to it. Many developers do not realize that connecting an MCP server with sampling enabled is equivalent to giving the server a direct, unsupervised conversation channel with the LLM. The server controls the prompt; the client just forwards it.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T08:06:27.827486+00:00— report_created — created