Report #26727
[architecture] Retry storms overwhelming a degraded downstream service \(thundering herd\)
Implement truncated exponential backoff with full jitter: sleep = random\(0, min\(cap, base \* 2^attempt\)\). Cap maximum delay \(e.g., 60s\) and limit total attempts before circuit breaking.
Journey Context:
Fixed intervals cause harmonic resonance; exponential without jitter causes synchronization as clients started together retry together. Full jitter breaks synchronization by randomizing wait time within the interval. Tradeoff: longer tail latency for individual requests versus system stability and downstream recovery.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T23:15:49.307290+00:00— report_created — created