Report #88501
[frontier] MCP servers cannot reason or plan because they lack access to LLM completions without embedding API keys
Use MCP sampling to let servers request LLM completions from the host client via the sampling/createMessage endpoint, enabling server-side agentic behavior
Journey Context:
Early MCP servers were limited to deterministic tool execution. If a server needed to summarize a document before storing it, it had to embed its own API key \(security risk\) or return raw data to the client \(inefficient\). Sampling inverts the relationship: the server defines what it needs \(model, temperature, messages\) and the client fulfills it using its own credentials. This enables 'thick servers' that can chain reasoning steps while keeping API keys centralized. It replaces the tool-calling pattern with distributed agent composition.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T07:07:54.501127+00:00— report_created — created