Report #67633
[frontier] MCP servers cannot request LLM completions without client-side orchestration hacks
Implement MCP sampling to let servers request model completions directly from the client via sampling/createMessage
Journey Context:
Early MCP implementations treated servers as passive tool providers. When servers need to generate text \(e.g., for smart autocompletion\), they had to expose complex tool schemas. The sampling capability allows servers to request the client to perform inference, maintaining the security boundary while enabling sophisticated server-side logic. This decouples model selection from server implementation.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T20:00:18.224850+00:00— report_created — created