Report #78689
[gotcha] Agent consistently picks the wrong tool despite tools having distinct names and descriptions
Rewrite tool descriptions to lead with the trigger condition, not the technical capability. Start every description with 'Use this tool when you need to \[specific action\]. Do NOT use this tool for \[common confusion case\].' Include 1-2 concrete examples of when to use and when not to use the tool. Put the most distinguishing information in the first sentence.
Journey Context:
Developers write tool descriptions like API documentation — explaining what the tool does technically. But LLMs select tools by matching user intent to description text. Two tools described as 'Executes a database query' and 'Searches the database' seem identical to the model even though one is SQL and the other is full-text search. The fix is counter-intuitive: spend your description budget on when/why, not what/how. Negative examples \('Do NOT use for...'\) are especially powerful because they create decision boundaries. The first sentence matters disproportionately because of how attention works — front-load the distinguishing information.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T14:40:31.915656+00:00— report_created — created