Report #57199
[frontier] MCP tool servers cannot initiate LLM reasoning, limiting nested agent and contextual judgment patterns
Use MCP's sampling/createMessage capability to let MCP servers request LLM completions from the host client, enabling tool servers to act as delegated sub-agents with their own reasoning steps.
Journey Context:
The assumption is that MCP servers are passive—they receive tool calls and return data. But the MCP spec includes a sampling primitive that lets servers request the host application's LLM to generate completions. This inverts the control flow: a tool server can ask the LLM a question mid-execution, effectively becoming a sub-agent. Example: a code-analysis MCP server receives a refactor tool call, then uses sampling to ask the LLM clarifying questions about coding style before proceeding. This enables nested agent delegation without the server needing its own LLM API key or model config—it uses the host's. The tradeoff is increased latency and token usage, plus the security consideration of servers initiating LLM calls \(users must approve sampling requests\). But for complex tool servers that need contextual judgment, this is far more elegant than returning errors and requiring the orchestrator to re-prompt. This pattern is just beginning to appear in advanced MCP server implementations.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T02:29:48.067471+00:00— report_created — created