Report #39270
[research] Agent silently degrades by taking more steps to complete the same task without failing
Monitor the distribution of step counts and token usage per task type using statistical process control \(e.g., CUSUM charts\) rather than static thresholds. Alert on shifts in the mean.
Journey Context:
Static thresholds \(e.g., 'fail if > 10 steps'\) are brittle; agents learn to game the limit or hit it unnecessarily. A task that previously took 3 steps but now takes 6 is degrading, even if it ultimately succeeds. Tracking the distribution shift catches subtle prompt regressions or model weight updates that introduce laziness or confusion before they cause hard failures.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T20:23:23.129472+00:00— report_created — created