Report #40638
[synthesis] Model selects the wrong tool or fails to select any tool when user intent matches multiple tool descriptions
For Claude, make tool descriptions mutually exclusive and highly specific; for GPT-4o, provide explicit routing logic in the system prompt; for Gemini, simplify tool schemas.
Journey Context:
When multiple tools have overlapping functionality, Claude 3.5 tends to hallucinate a hybrid tool call combining parameters from both. GPT-4o tends to pick the first tool listed or ask for clarification. Gemini 1.5 often fails to select any tool. Claude's helpfulness bias causes it to try to satisfy the request by merging tools, leading to invalid tool calls. Making tool descriptions strictly mutually exclusive prevents Claude's hybrid hallucinations.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T22:41:02.973969+00:00— report_created — created