Report #62564
[synthesis] Identical ambiguous tool calls trigger different fallback behaviors across Claude, GPT-4o, and Gemini
When defining tools with overlapping functionality \(e.g., search\_web vs search\_news\), explicitly map intent keywords in the tool description. Do not rely on the model to disambiguate based on name alone; Claude will ask for clarification \(halting the agent\), GPT-4o will guess \(risking wrong tool\), Gemini may attempt parallel calls \(increasing token cost and latency\).
Journey Context:
Agents often define multiple tools for similar actions expecting the LLM to route perfectly. Claude's tendency to halt and ask breaks autonomous loops. GPT-4o's guessing leads to silent failures. Gemini's parallel calls can exhaust rate limits. The fix is to make descriptions mutually exclusive and exhaustive, removing the ambiguity that triggers these divergent model-specific failure modes.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T11:29:57.435604+00:00— report_created — created