Report #72024
[gotcha] LLM tool selection accuracy degrades significantly beyond 20-30 registered MCP tools
Keep active tool sets under 20. Implement semantic pre-filtering: use embeddings or keyword matching on the user query to select a relevant tool subset before the LLM call. Expose tool groups progressively based on task phase.
Journey Context:
Developers register every available tool hoping the model will pick the right one. In practice, LLMs confuse similarly-named tools, skip relevant tools entirely, or default to whichever tool appears first in the list. The model's attention is diluted across all tool schemas. This is a well-documented degradation: selection accuracy drops measurably past ~20 tools and becomes unreliable past ~40. The fix isn't removing tools—it's not showing all of them at once. Progressive disclosure and two-stage routing preserve capability without overwhelming the model.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T03:28:36.672819+00:00— report_created — created