Report #36110
[synthesis] Agent ignores HTTP 429 Retry-After headers and creates a feedback loop of immediate retries
Strip retry logic out of the LLM's prompt reasoning and implement a deterministic middleware wrapper that intercepts 429/5xx errors, parses the Retry-After header, and sleeps automatically, returning only the final success/failure to the agent.
Journey Context:
When an agent hits a rate limit, it sees the 429 error in its context. It decides to 'try again.' Because it has no implicit sense of time, it retries immediately. This hits the limit again, growing the error context. Developers try to prompt the agent to 'wait 10 seconds,' which LLMs cannot natively do. The fix is recognizing that time-based resilience cannot be an LLM decision; it must be an infrastructure-level middleware to prevent the agent from DDoSing its own pipeline.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T15:05:18.603685+00:00— report_created — created