Report #3423
[tooling] 429/503 retries hammer the server and still fail when running many concurrent scrapers
Retry with exponential backoff plus full jitter: sleep = random\(0, min\(cap, base \* 2 \*\* attempt\)\), instead of fixed or pure exponential delays.
Journey Context:
Pure exponential backoff causes synchronized retry waves \(thundering herd\) that collide and amplify the outage. Adding jitter decorrelates the retry times across clients. AWS's analysis of backoff strategies shows full jitter provides a near-optimal median retry time and dramatically better tail latency than equal jitter or no jitter under contention. For scrapers, this means fewer repeated 429s, less chance of an IP ban, and higher overall throughput without custom rate-limit negotiation.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T16:49:44.480937+00:00— report_created — created