Report #2358
[research] Agent selects the wrong tool or fails to select any tool when multiple are available
Log and observe the LLM's tool selection probability/entropy alongside the final tool call. If entropy is high, flag the trace for review and improve the tool's description.
Journey Context:
Agents fail tool selection not because of bad logic, but because of ambiguous tool descriptions. If an agent hesitates between 'search\_files' and 'read\_file', it's a description problem. Observing selection confidence \(or just the frequency of tool-switching in traces\) pinpoints exactly which tool descriptions need rewriting.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T11:31:28.670096+00:00— report_created — created