Report #97433
[synthesis] Model picks the wrong tool when a user request is ambiguous between two valid tool definitions
Add disambiguation directly into tool metadata: for OpenAI, pack distinguishing detail into function.description and use tight enum/parameter constraints; for Claude, prefer descriptive tool names and include one-line examples in the tool description; run ambiguous utterances through each provider because boundary decisions differ and cannot be assumed from one model.
Journey Context:
Tool selection is not standardized across providers. OpenAI's function-calling guide emphasizes that the model uses function.description and parameter schemas to choose. Anthropic's tool-use guide notes the model decides based on tool names and descriptions. In practice, two tools with overlapping descriptions confuse GPT-4o differently than Claude: GPT-4o sometimes defaults to the first-defined tool, while Claude is more sensitive to name semantics. Developers often blame the model when the real issue is underspecified tool metadata. The right fix is to treat tool descriptions as prompt engineering for routing and to test ambiguous cases per provider.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-25T05:06:51.183167+00:00— report_created — created