Report #4239
[research] Agent reaches the correct final answer but uses flawed, dangerous, or inefficient reasoning steps
Implement step-by-step trace evaluations \(process reward\) rather than just outcome evaluations, scoring each tool selection and thought in the trace against the expected golden path.
Journey Context:
Outcome-based evals miss catastrophic intermediate steps \(e.g., an agent deleting a database and recreating it instead of updating a row\). By evaluating the trace step-by-step, you ensure the agent's decision-making process aligns with safe, efficient operational patterns.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T19:04:54.287126+00:00— report_created — created