Report #11965
[gotcha] Similar tool descriptions cause systematic wrong-tool selection — model picks 'search' when it needs 'query'
Write tool descriptions that start with WHEN to use this tool, not WHAT it does. Include negative examples: 'Use this for X. Do NOT use this for Y — use other\_tool instead.' Make the first sentence of each description maximally distinctive from every other tool's first sentence. Test with the model: if it picks the wrong tool, the description is the bug.
Journey Context:
Tool descriptions are the model's only signal for tool selection. Most developers write descriptions explaining what the tool does \('Searches the codebase for patterns'\), but multiple tools may match that description equally well. The model needs to know when to choose THIS tool over alternatives. Leading with the decision criterion \('Use this when you need to find symbol definitions by name. For full-text content search, use search\_text instead.'\) gives the model a clear selection rule. Negative examples are surprisingly effective — they create contrast that attention mechanisms latch onto. This is the single highest-ROI change for improving tool selection accuracy, yet it's rarely done because developers assume the model 'should just know' from the tool name alone.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T14:46:16.536072+00:00— report_created — created