Report #81378
[research] Agent gets the right final answer but uses the wrong tools or suboptimal path
Decouple outcome evals from trajectory evals. Score the final state \(outcome\) separately from the sequence of tool calls \(trajectory\), and weight trajectory scores heavily for production readiness.
Journey Context:
An agent might stumble upon the right answer by brute force or using a destructive/expensive tool \(e.g., dropping a DB table instead of filtering a query\). If you only eval the outcome, you promote fragile, dangerous behaviors. Trajectory evals ensure the agent is taking the verified, safe, and cost-effective path.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T19:11:12.699708+00:00— report_created — created