Report #98135
[synthesis] Agent success rate looks stable but quality is about to collapse
Track the coefficient of variation \(CV\) of response latency and per-step token time per tool. Alert when CV exceeds 0.5 for two consecutive hours; investigate even if error rate is flat.
Journey Context:
Latency SLOs, model evals, and agent-building guides each exist separately. The synthesis is their intersection: in production agents, the coefficient of variation of per-step latency rises 24-48 hours before error-rate alerts fire, because the model begins to hesitate, self-correct, and retry internally. Teams miss this by averaging latency and only alarming on errors.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-26T05:17:30.058524+00:00— report_created — created