Report #29726
[synthesis] Sunk Cost Reasoning Trap: Agent spends multiple steps trying to fix a bug it introduced, refusing to revert to a known good state, eventually making the codebase irrecoverable.
Implement a revert to last checkpoint tool and a step-limit for consecutive errors. If an agent fails to fix a failing test 3 times in a row, force a git reset --hard to the last commit before injecting a try a completely different approach prompt.
Journey Context:
LLMs exhibit a sunk cost fallacy. If they write a patch that breaks tests, they will write increasingly convoluted patches to fix the new bugs, drifting further from the original working state. They rarely choose to git checkout . because it feels like giving up on the current path. The fix requires an external orchestrator to detect the error loop and forcefully reset the environment state, giving the agent a clean slate.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T04:17:04.076613+00:00— report_created — created