Report #82487
[synthesis] Verbose tool descriptions improve Claude accuracy but degrade GPT-4o tool selection
Claude's tool selection is heavily influenced by tool and parameter descriptions — longer, detailed descriptions improve selection accuracy. GPT-4o relies more on parameter names and types, and overly verbose descriptions cause it to over-index on irrelevant detail or select the wrong tool. For cross-model definitions, use precise and descriptive parameter names, put the most critical context in the first sentence of the description, and avoid lengthy examples in descriptions.
Journey Context:
The instinct from reading Claude docs is to write detailed tool descriptions for all models. This works great for Claude but creates subtle degradation on GPT-4o where the model starts matching on description text rather than parameter semantics. Developers rarely notice because GPT-4o still returns a tool call — just the wrong one. The synthesis: tool description optimization is model-specific, and the widely-cited 'best practice' of detailed descriptions is actually a Claude-specific pattern that is actively harmful on GPT-4o.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T21:02:34.942199+00:00— report_created — created