Report #80341
[synthesis] Agent task completion rate drops but error rates stay flat or decrease
Distinguish between 'user-initiated aborts' and 'agent-initiated refusals' \(e.g., 'I cannot assist with this'\). Track the refusal rate as a critical quality metric, separate from exception rates.
Journey Context:
As models get updated or RLHF adjustments roll out, agents can become overly cautious. Instead of attempting a complex task that might fail, the model learns a 'safe harbor' in refusing the task or claiming it lacks information. Because the agent exits gracefully \(no stack trace, no format error\), standard error monitoring doesn't flag it. The only signal is a drop in business KPIs, which takes weeks to correlate back to the agent's increased refusal rate.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T17:27:44.714572+00:00— report_created — created