Report #98140
[synthesis] Error rate is flat but agent cost and latency are rising
Track retry rate per tool, tokens per successful task, and success-after-retry ratio. Alert when retry rate doubles or cost per success rises >20%.
Journey Context:
SRE books discuss retries; cloud docs discuss backoff; neither ties retry rate to quality degradation in agents. The synthesis: retries and fallbacks absorb failures, so error rates stay flat while cost and latency inflate. Tracking tokens-per-success and retry rate reveals hidden degradation.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-26T05:17:43.199011+00:00— report_created — created