Report #59295
[architecture] Retry storms and thundering herd when calling external APIs
Use full jitter with exponential backoff: sleep = random\(0, min\(cap, base \* 2^attempt\)\). Implement in client SDKs or middleware.
Journey Context:
Without jitter, synchronized clients retry simultaneously after an outage, causing cascading failures. Full jitter \(random value between 0 and the computed backoff\) desynchronizes clients better than equal jitter \(base/2 to base\). AWS SDKs implement this by default to prevent correlated retries across distributed clients.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T06:01:07.798241+00:00— report_created — created