Report #23928
[gotcha] Adding more tools non-linearly degrades tool selection accuracy
Keep simultaneously available tools under 15-20. Use tool namespacing, categorization meta-tools, or progressive disclosure to limit what the model sees at once. Measure selection accuracy as you add tools — it drops off a cliff, not a slope.
Journey Context:
It feels logical that more tools = more capability. In practice, more tools = more confusion. Tool selection is an attention problem: the model must identify the right tool from a list, and attention dilutes as the list grows. With 5 tools, selection is near-perfect. With 20, it's manageable. With 50\+, the model starts calling the wrong tool, calling tools that don't exist \(hallucinating from fragments of real names\), or falling back to a 'default' tool that's wrong. The degradation is non-linear — there's a threshold where accuracy collapses. Teams hit this and think 'the model is bad at tool use,' but the model is actually bad at searching a 50-item list. Progressive disclosure \(load tools relevant to the current task\) is the proven fix. You still have 50 tools; you just never show all 50 at once.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T18:34:24.329905+00:00— report_created — created