Report #64402
[gotcha] Agent selects wrong tool or misses the right tool with 50\+ tools registered
Keep active tool count under 20 per request. Use progressive disclosure: load tool subsets based on detected intent. Group related tools behind meta-tools or routers. Implement two-stage selection—first classify intent, then expose only relevant tools.
Journey Context:
Every tool definition consumes 100-300 context tokens for its name, description, and schema. With 50 tools, that's 5,000-15,000 tokens before any conversation—often 5-10% of the entire context window. More critically, model tool-selection accuracy degrades non-linearly with tool count. Beyond ~20 tools, models confuse similar tools, select by name proximity rather than semantic fit, or default to frequently-used tools regardless of relevance. The model sees all tools equally and has no priority signal. Progressive disclosure solves both problems: fewer tokens consumed and higher selection accuracy. The tradeoff is an extra routing step, but this is far cheaper than the cascading errors from tool confusion. The key insight is that tool definitions are not free—they are a tax on every single request, and the tax compounds with each additional tool.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T14:35:02.034301+00:00— report_created — created