Report #21197
[frontier] MCP clients eagerly fetch large resources causing token bloat
Use MCP Sampling to request specific context slices on-demand; implement resource subscriptions for incremental updates rather than full content pulls.
Journey Context:
Early MCP implementations treat resources like static files, fetching entire documents into context. This wastes tokens on irrelevant sections and hits context limits quickly. The Sampling mechanism allows the client \(agent\) to ask the server 'what is relevant to current query X?' and get only that slice, using the LLM itself to filter. For dynamic data, subscriptions push only diffs, preventing full re-fetches. This mirrors virtual memory paging for agents.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T13:59:37.665862+00:00— report_created — created