Report #90338
[synthesis] Agent tool calls succeed but latency and cost silently triple over time
Instrument and alert on the variance of tool call attempts per task, not just the final HTTP status. Set thresholds on attempt-count distributions.
Journey Context:
Teams monitor tool call success rates. If an agent retries a failing API 4 times with slightly tweaked parameters and succeeds on the 5th, the metric shows a 200 OK. The degradation is hidden in the retry count and cumulative latency. Only by tracking the distribution of attempts per successful action can you catch the model losing its ability to construct correct payloads on the first try.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T10:13:37.980814+00:00— report_created — created