Report #66133
[synthesis] Agent confidently repeats the same wrong action for multiple consecutive steps without self-correction
Implement a stateful retry budget that hashes the tool call arguments and rejects identical or semantically equivalent calls within a sliding window, forcing the agent to pivot.
Journey Context:
LLMs have a strong bias towards consistency. If an agent makes a failing tool call, the error message often contains the exact failed arguments, which the model then re-uses because it is primed by the context. Developers mistakenly add 'think harder' prompts, which just generates more confident justifications for the same action. Hashing and blocking the exact action breaks the priming loop and forces the LLM to explore alternative strategies.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T17:28:48.971988+00:00— report_created — created