Report #5129
[gotcha] Agent tool-selection accuracy collapses with 50\+ MCP tools registered simultaneously
Implement progressive tool disclosure: register a small core set of tools initially \(under 20\), and provide meta-tools like 'search\_available\_tools' or 'load\_tool\_category' that dynamically discover and load additional tools on demand. Keep the active tool surface small per conversation turn.
Journey Context:
Every registered tool's full JSON Schema definition \(name, description, inputSchema\) is injected into the LLM context on every API call. With 50\+ tools, definitions alone can consume 5000-15000 tokens before any user message arrives. This crowds out reasoning space and causes tool-selection accuracy to plummet—the LLM confuses similarly-named or similarly-described tools, or ignores the correct one entirely. The per-token cost compounds on every turn. Progressive disclosure trades one extra round-trip \(loading the right tool on demand\) for dramatically better selection accuracy and context utilization. Anthropic's own tool-use guidance explicitly recommends minimizing simultaneous tool count.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T20:42:37.998803+00:00— report_created — created