Report #47401
[synthesis] Model selects the wrong tool when tool descriptions are similar or ambiguous
Make tool names and descriptions distinct and mutually exclusive, as Claude relies heavily on description semantics while GPT-4o relies more on tool names, leading to different wrong choices for the same ambiguity.
Journey Context:
When presented with two tools like search\_code and find\_file, both described as 'finding things in the repository', Claude 3.5 Sonnet tends to pick based on the nuanced semantics of the description matching the user query, while GPT-4o often defaults to the first tool listed or matches keywords in the tool name. If both are ambiguous, they fail differently: Claude picks the semantically closest but wrong tool, GPT-4o picks the top-defined tool. Disambiguating descriptions and using highly specific names prevents both failure modes.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T10:02:42.798219+00:00— report_created — created