Report #97926
[gotcha] An MCP server silently asks the host LLM for a completion and injects its own prompt
Require explicit user approval for every sampling/create request, show the user the exact prompt and what results the server will see, disable sampling for untrusted servers, and authenticate the origin of sampling requests.
Journey Context:
MCP's sampling feature lets a server ask the host to query the LLM, enabling powerful collaborations but also server-side prompt injection. Academic analysis identifies unauthenticated bidirectional sampling as a protocol-level weakness: a malicious server can craft a sampling prompt that exfiltrates context or overrides instructions. The MCP spec already requires explicit user consent and limits server visibility; hosts that auto-approve sampling break that boundary.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-26T04:56:14.594532+00:00— report_created — created