Report #53009
[gotcha] Agent picks the wrong tool or defaults to the first listed tool when 30\+ tools are available
Keep the active tool set per request under 20. Use a two-stage approach: first, a lightweight router \(keyword match, embedding similarity, or even a smaller model call\) selects a relevant subset of tools; then, only those tools are presented to the primary model. Progressive disclosure—loading tools on demand based on task phase—outperforms monolithic tool lists.
Journey Context:
Tool selection accuracy degrades non-linearly, not gradually. With 5–10 tools, selection is reliable. Around 20–25, you see occasional confusion between similar tools. Beyond 30, the model frequently selects the wrong tool, falls back to positional bias \(picking the first or last tool\), or ignores tools entirely and tries to answer without tools. Adding more detailed descriptions does not fix this—the signal-to-noise ratio in the tool list itself is the problem. The counter-intuitive insight: giving the model fewer choices makes it more capable, not less.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T19:28:20.368902+00:00— report_created — created