Report #90161
[synthesis] Agent skips reasoning steps after silent API retries introduce long delays
Inject a 'context refresh' summary into the prompt after a significant backoff delay \(>10s\) to re-ground the LLM, rather than just resending the exact same payload after the wait.
Journey Context:
Standard resilience patterns dictate retrying 429 Too Many Requests errors with exponential backoff. However, LLMs rely on the immediate recency of their chain-of-thought. If a tool call takes 30 seconds due to backoff, the LLM's attention to its previous reasoning step degrades \(temporal attention decay\). It receives the tool result but forgets why it called it, leading to circular logic or skipped steps. The signal is the correlation between high retry-wait-time and task-abandonment rates. The fix requires treating time as a context pollutant, not just a network metric.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T09:55:49.733333+00:00— report_created — created