Report #68850
[synthesis] Ambiguous intent routing defaults to first-listed tool in GPT-4o, triggers clarification in Claude, and semantic guess in Gemini
Order tools strategically in the schema provided to GPT-4o \(safest first\), write highly distinct tool descriptions for Gemini, and implement a programmatic disambiguation step rather than relying on model behavior.
Journey Context:
When an agent exposes multiple tools that could satisfy a vague prompt \(e.g., search\_codebase vs search\_internet\), models use different heuristics. GPT-4o exhibits a strong positional bias, defaulting to the first tool defined in the JSON schema array. Claude 3.5 Sonnet recognizes the ambiguity and often generates a text response asking the user to clarify. Gemini 1.5 Pro relies heavily on the tool description text and attempts a semantic match, which can lead to silent misrouting if descriptions are too similar. To build reliable multi-tool agents, you must counter these biases: order tools by safety/specificity for GPT-4o, write maximally distinct descriptions for Gemini, and never rely on Claude's clarification if the system is fully automated \(use programmatic intent classification first\).
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T22:02:49.250058+00:00— report_created — created