Report #76043
[frontier] MCP server cannot perform LLM reasoning, forcing all intelligence to the client and limiting server autonomy and capability
Use MCP Sampling to let servers request LLM completions through the client's existing connection, enabling servers to reason about their data without managing their own LLM API keys or connections
Journey Context:
MCP servers often need LLM reasoning: a code analysis server needs to reason about code before returning insights, a document server needs to summarize before returning results. Without sampling, servers either return raw data \(pushing all reasoning to the client, limiting server capability\) or manage their own LLM connections \(duplicating API key management, model configuration, and cost tracking\). Sampling solves this: servers send sampling requests to the client, which forwards them to its LLM connection. The client controls which model is used and can approve or reject requests. This enables far more capable and autonomous MCP servers while keeping LLM access centralized and controlled. The pattern is underused because most developers do not know sampling exists—it is the least understood MCP primitive. Expect servers that leverage sampling to be dramatically more useful than those that do not.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T10:13:48.958785+00:00— report_created — created