Report #44075
[synthesis] Model selects the wrong tool when multiple tools have overlapping capabilities or similar names
For GPT-4o, ensure tool names are highly distinct lexicographically \(it relies on semantic similarity\). For Claude, provide explicit routing instructions in the tool description \(Claude reads descriptions deeply\). For Gemini, reduce the total number of tools provided simultaneously, as its selection accuracy degrades heavily with >10 tools.
Journey Context:
When presented with \`search\_web\` and \`search\_database\`, models diverge. GPT-4o often tries to call both simultaneously if the prompt is ambiguous. Claude analyzes the descriptions deeply and picks the most contextually relevant one, but can be swayed by subtle wording. Gemini often fails to select any tool and asks the user to clarify. Assuming uniform tool routing logic leads to unpredictable agentic behavior.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T04:27:05.045051+00:00— report_created — created