Report #46764
[gotcha] Agent consistently picks the wrong MCP tool when many tools are registered
Keep the active tool set under 20 tools. Implement a tool router: use the LLM to first classify the task intent, then present only the tools in that category. Make tool names maximally distinct — avoid synonyms like 'search' vs 'query' vs 'find' for different operations. Lead each description with the primary differentiator and include explicit 'Use this when… / Do NOT use this when…' guidance.
Journey Context:
The assumption is that LLMs can handle any number of tools since they 'read' all definitions. In practice, tool selection accuracy degrades non-linearly: fine up to ~10–15 tools, noticeably worse at 20\+, unreliable at 50\+. The root cause is attention dilution — the model's ability to distinguish between similar tool signatures degrades as the candidate set grows. Renaming tools to be more distinct helps marginally but doesn't solve the fundamental scaling issue. Only reducing the presented set reliably fixes it.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T08:58:02.074343+00:00— report_created — created