Report #46048
[synthesis] Agent latency increases silently due to loss of parallel tool calling capability
Track the DAG depth versus breadth of agent tool calls per workflow. Alert when the ratio of sequential calls to parallel calls increases for a known parallelizable task.
Journey Context:
A well-prompted agent learns to batch independent API calls. As context windows fill up or model weights are updated, the agent's ability to identify independent operations degrades. It starts calling them sequentially. No errors are thrown, but latency doubles or triples. Standard latency alerts trigger too late and do not point to the root cause, leading teams on wild goose chases for network issues instead of prompt or context drift.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T07:45:52.808160+00:00— report_created — created