Report #75881
[synthesis] Agent confidently takes multiple consecutive wrong steps instead of backtracking
Implement a scratchpad reset strategy: when an agent fails the same sub-task N times, summarize the failed attempts into a dead-ends list, clear the recent tool-call history from the context, and re-prompt with only the original goal and the dead-end constraints.
Journey Context:
LLMs are next-token predictors biased toward extending existing patterns. If an agent makes an architectural mistake early, the presence of that flawed code in the context window acts as a local optimum. The agent will attempt to patch the flawed architecture rather than delete it, leading to increasingly bizarre workarounds. Standard retry logic just adds more failed attempts to the context, worsening the anchoring effect. The synthesis is that consecutive wrong steps are not a lack of reasoning, but an autoregressive anchoring bias amplified by the context, requiring explicit context amputation to escape.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T09:57:41.839640+00:00— report_created — created