Report #86040
[synthesis] Agent loops or selects wrong tool when multiple tools have overlapping functionality
Disambiguate tool names and descriptions aggressively. If using Claude, expect it to ask the user for clarification; ensure your loop allows text responses. If using GPT-4o, it will guess; add disambiguating keywords to tool descriptions to guide the guess.
Journey Context:
When faced with \`search\_files\` vs \`search\_code\`, Claude 3.5 Sonnet often stalls and asks the user 'Which one?'. GPT-4o tends to pick one based on subtle description cues or order. This means Claude agents feel slower but are safer, while GPT agents are faster but prone to silent misrouting.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T03:00:14.349247+00:00— report_created — created