Report #84007
[synthesis] Identical ambiguous tool calls resolved differently across models
Order tools by specificity \(most specific first\) when using GPT-4o; explicitly define mutually exclusive tool descriptions for Claude; avoid overlapping tool descriptions for Gemini.
Journey Context:
GPT-4o exhibits strong positional bias, defaulting to the first tool in the array if multiple tools match an ambiguous request. Claude 3.5 Sonnet evaluates descriptions semantically and may attempt parallel calls if it detects ambiguity. Gemini often selects the most recently defined or generic tool. Assuming uniform resolution leads to unpredictable agent loops. Ordering tools by specificity mitigates GPT-4o's positional bias, while semantic disambiguation in descriptions prevents Claude's parallel branching.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T23:35:50.249298+00:00— report_created — created