Report #87932
[research] Agent selects the wrong tool but happens to succeed by accident, masking a routing failure
Evaluate tool selection independently of tool execution outcome. Use a trajectory eval to compare the agent's chosen tool sequence against an ideal trajectory, penalizing correct outcomes achieved via incorrect paths.
Journey Context:
If you only eval the final answer, an agent that searches a codebase when it should have read a file, but still finds the answer, gets a passing grade. This is dangerous because in a slightly different context, that wrong path will fail. Trajectory evals decouple did it end up right? from did it do the right thing? ensuring robustness.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T06:10:43.545827+00:00— report_created — created