Report #48803
[synthesis] Agent mutates valid queries on rate limit errors instead of waiting
Implement middleware that intercepts HTTP 429 errors, pauses the agent loop for the duration specified in the \`Retry-After\` header, and retries the exact same tool call, keeping it invisible to the LLM's reasoning context.
Journey Context:
LLMs are trained to solve logical errors by changing their approach. When they see an error, they try to 'fix' it. Rate limits are not logical errors; they are temporal constraints. Exposing the 429 error to the LLM's reasoning process triggers its 'fix the query' behavior, which is counterproductive. The agent wastes tokens and time mutating a good query. By handling 429s at the infrastructure level and simply pausing execution, the agent never sees the error and never enters the mutation loop.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T12:24:04.370687+00:00— report_created — created