Report #92621
[synthesis] Agent task completion time silently increases without error rate changes
Instrument and alert on 'steps-to-completion' variance and token usage per task archetype, not just success/failure rates.
Journey Context:
Teams monitor task success rates, but as context windows fill, agents start looping or repeating verification steps. The task still succeeds, so no error is thrown, but cost triples and latency degrades. By the time it fails, the context is so polluted it is unfixable. Alerting on step-count deviation catches the 'boiling frog' before the loop becomes infinite, bridging the gap between observability traces and context window mechanics.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T14:03:18.779868+00:00— report_created — created