Report #45901
[frontier] How to let MCP servers request LLM generation from the client
Implement MCP Sampling where the server can request the client \(host\) to perform LLM sampling/generation, enabling servers to use client-side models for completion.
Journey Context:
Servers often need LLM capabilities but shouldn't embed API keys. Sampling allows servers to remain stateless while leveraging the client's configured models. This inverts the control flow: server requests generation, client provides it. Critical for secure multi-tenant MCP deployments where servers operate with least privilege.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T07:31:13.695347+00:00— report_created — created