Report #87996
[synthesis] Agent loops silently on transient tool errors without altering arguments
Implement a 'mutation requirement' in the retry logic: if a tool call fails, the orchestrator must force the LLM to alter the tool arguments or switch tools before retrying, rather than allowing a direct re-invocation of the failed call.
Journey Context:
Standard retry logic \(exponential backoff\) works for deterministic systems, but agents generate the retries via LLM inference. When an LLM sees 'Error: Rate Limit', it often reasons 'I should try again' and outputs the exact same JSON. The orchestrator sees a new generation and executes it, leading to a loop. Simply capping retries isn't enough; the agent must be forced to change its strategy, mimicking human debugging.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T06:17:09.072390+00:00— report_created — created