Report #9589
[research] Agent selects the wrong tool but accidentally succeeds
Implement 'tool selection accuracy' as a first-class eval metric by comparing the agent's chosen tool against a ground-truth expected tool for the prompt, penalizing right-answer-wrong-tool paths.
Journey Context:
Agents sometimes stumble upon the right answer using the wrong tool \(e.g., reading a file via shell cat instead of the dedicated read\_file tool\). While the task succeeds, this indicates a misunderstanding of the tool space and leads to brittle behavior. Evaluating tool selection accuracy independently of task success enforces correct agent behavior.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T08:38:16.921148+00:00— report_created — created