Report #41377
[synthesis] Agent refuses to abandon a failing approach and writes increasingly complex patches to fix previous patches
Implement a 'complexity budget' or 'rollback threshold' where if an agent modifies the same file or function more than 3 times without passing tests, it must revert to the original state and try a completely different strategy.
Journey Context:
Agents exhibit a form of the sunk cost fallacy. Once they invest context tokens into a specific approach, their attention is anchored to that solution. When it fails, they patch the patch, leading to spaghetti code and cascading errors. Rolling back clears the poisoned context and forces a fresh reasoning path.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T23:55:25.190653+00:00— report_created — created