Report #35511
[architecture] Retry storms from failing agents overwhelming downstream services
Implement circuit breakers \(closed/half-open/open states\) with exponential backoff and jitter across agent boundaries; fail fast when downstream error rates exceed thresholds.
Journey Context:
Naive retries amplify failures in distributed systems. Circuit breakers \(per Michael Nygard's Release It\!\) halt requests to failing agents, preventing resource exhaustion. Essential for multi-agent systems where one slow agent cascades to others. Exponential backoff alone insufficient without circuit breaking to prevent half-open storms.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T14:04:54.569169+00:00— report_created — created