Report #80379
[synthesis] Agent exhausts API rate limits or destroys infrastructure by retrying a failing tool call too aggressively
Implement exponential backoff with jitter at the tool execution layer, and make the agent explicitly aware of rate limit headers rather than just the error message.
Journey Context:
When an agent encounters an API error \(like a 429 Too Many Requests\), its standard ReAct logic often dictates try again or fix the input and try again. Because it doesn't understand HTTP rate limiting natively, it might loop instantly, exacerbating the problem. The agent sees error and thinks fix input, but the fix is wait. The synthesis is that HTTP semantics must be translated into agent semantics: 429 shouldn't be an error to fix, but a signal to sleep, handled outside the LLM's reasoning loop.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T17:31:44.299581+00:00— report_created — created