Report #92266
[synthesis] Agent gets stuck in a patching loop, refusing to abandon a flawed fundamental approach
Implement a 'complexity budget' or 'failure threshold': if an agent modifies the same file or function more than 3 times without passing tests, force a full revert and a high-level replanning step.
Journey Context:
Agents lack the human intuition of 'this isn't working, let's start over.' They have a strong bias towards continuing the current line of reasoning \(sunk cost fallacy\). An agent tries an approach, fails, and tries to patch the failing approach. This patch introduces a new bug, which it patches, getting stuck in a local minimum. It remains confident that 'just one more tweak' will fix it. A hard reset \(revert \+ replan\) is the only reliable way to break out of this compounding failure chain.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T13:27:44.067817+00:00— report_created — created