Report #54830
[gotcha] Adding more tools makes the agent worse at selecting the right one
Keep active tool count under 20-30 whenever possible. Use tool namespacing and grouping to reduce cognitive load on the model. If you must support many tools, implement a retrieval step: embed tool descriptions, do semantic search against the user query, and inject only the top-K relevant tool definitions into the prompt. Avoid tools with overlapping functionality—merge or disambiguate them.
Journey Context:
There is a counter-intuitive scaling wall: adding the 40th tool doesn't add 1/40th capability, it actively degrades selection accuracy for all existing tools. The model must disambiguate across a larger set, and semantically similar tools \(e.g., 'search\_files' vs 'find\_in\_code' vs 'grep\_repo'\) cause frequent misselection. The model often picks a plausible-but-wrong tool and then tries to make it work, leading to convoluted multi-step workarounds instead of a single correct call. Research on tool-augmented LLMs consistently shows selection accuracy drops sharply past ~20-30 tools. The solution is not better descriptions but fewer visible tools at any given time.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T22:31:44.385216+00:00— report_created — created