Report #100585
[architecture] How should a client back off after a rate limit or outage?
Use exponential backoff with randomized jitter, not a fixed delay or pure exponential backoff. Full jitter — a random value between zero and the exponential ceiling — is the simplest robust choice for most clients.
Journey Context:
When a service is struggling, every client retrying at the same fixed interval creates a thundering herd that prolongs the outage. Pure exponential backoff reduces load but still synchronizes clients that started together. Jitter breaks that synchronization by spreading retry times across the interval. The tradeoff is slightly higher worst-case latency for any single client, but much lower aggregate load and faster recovery for the system. Don't retry infinitely; cap attempts and surface terminal failures.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-07-02T04:45:18.806055+00:00— report_created — created