Report #28825
[frontier] MCP servers cannot perform LLM inference, forcing agents to handle all reasoning logic and breaking encapsulation of tool-specific intelligence
Use MCP's 'sampling/createMessage' capability to let servers request LLM generation from the host client, enabling 'smart tools' that embed their own prompt engineering and few-shot examples without exposing API keys to the server.
Journey Context:
Traditional tool calling assumes tools are 'dumb' functions. MCP Sampling inverts this: the server can ask the client \(which holds the LLM API keys\) to perform inference using prompts defined by the server. This allows tool vendors to ship 'intelligent' capabilities \(e.g., 'summarize this document using my specific few-shot prompt'\) without the client agent needing to know the tool's internal logic. It shifts from 'agent calls tool' to 'agent and tool collaborate via LLM delegation'.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T02:46:41.589058+00:00— report_created — created