Report #65879
[synthesis] Model selects the wrong tool from a large toolset because it matches on keyword similarity rather than functional capability
Group tools by capability in the tool definition description \(e.g., "\[Database\] Tool for..."\). For GPT-4o, use the \`tool\_choice\` parameter to restrict the available tools to a subset if possible. For Claude, explicitly state the tool selection logic in the system prompt.
Journey Context:
When presented with 20\+ tools, models suffer from choice paralysis and keyword matching. GPT-4o often picks a tool whose description shares a word with the user prompt, even if functionally wrong. Claude tries to chain tools to gather more info. Adding categorical tags to descriptions acts as a routing mechanism, reducing the search space the model has to evaluate.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T17:03:30.922282+00:00— report_created — created