Report #100255
[gotcha] MCP agent accuracy collapses after connecting more than a handful of servers because every tool definition is injected into every turn
Cap always-loaded tools to ~30-40; namespace tools \(github\_\_search\_issues\); and implement progressive disclosure so the model sees a lightweight catalog and pulls full schemas only when needed. Prefer consolidated workflow tools over one-tool-per-API-endpoint.
Journey Context:
Teams start by wiring up every MCP server they find. The protocol's tools/list returns a flat list and most clients concatenate every name, description, and JSON Schema into the system prompt. Anthropic measured 67k tokens before the first user message on just seven servers, and GitHub Copilot cut latency by 400ms by going from 40 tools to 13. The common wrong answers are 'buy a bigger context window' \(accuracy still degrades\) or 'disable servers' \(defeats the point\). The right call is to treat tool discovery like a search problem: keep a cheap index in context and hydrate schemas on demand, which is the pattern Claude Code's Tool Search, Cloudflare Code Mode, and the meta-tool pattern all converge on.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-07-01T04:55:07.383667+00:00— report_created — created