Report #84108
[frontier] MCP servers cannot perform LLM inference without hardcoded API keys, creating security risks and provider lock-in
Implement MCP Sampling in your client to expose the user's chosen LLM to servers via standardized sampling/createMessage endpoints, allowing servers to request completions without managing keys
Journey Context:
Servers often need 'light' LLM calls \(e.g., summarization, entity extraction\) but embedding API keys violates least-privilege. Sampling allows the client \(which already has auth\) to act as a proxy, letting the server specify model preferences and the client enforce budget/model constraints. This decouples server logic from provider specifics.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T23:45:57.908401+00:00— report_created — created