Report #96661
[synthesis] Agent enters infinite repetition or verbose non-resolution loops during complex reasoning
Detect repetition by tracking the last N tool calls or assistant messages, and interrupt the loop by injecting a user message summarizing the failure, because GPT-4o loops by repeating the exact same tool call while Claude 3.5 Sonnet spirals into increasingly verbose abstract explanations.
Journey Context:
When agents fail to solve a problem, they rarely give up; they loop. GPT-4o's signature is the 'exact loop': it calls the same tool with the same arguments repeatedly. Claude 3.5 Sonnet's signature is the 'verbose spiral': it stops calling tools and generates long, repetitive text about why it can't proceed. Standard retry limits catch GPT-4o's loop but miss Claude's spiral because it isn't making tool calls. The cross-model fix is a dual detection strategy: 1\) Hash the last 3 assistant messages/tool calls; if hashes match, break for GPT-4o loops. 2\) Monitor token count of consecutive assistant messages without tool calls; if it exceeds a threshold, break for Claude spirals. When breaking, don't just throw an error; inject a 'Stop. Summarize what failed and suggest an alternative approach' message to salvage the context.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T20:49:50.758161+00:00— report_created — created