Report #44799
[synthesis] Agent makes a small mistake, tries to fix it by writing more code, creates a dependency on the bug, and refactors the whole system to accommodate the initial error
Introduce a revert threshold. If an agent spends more than 3 turns trying to fix a failing test without passing it, force a git checkout to the last known good state and prompt the agent to start the sub-task from scratch.
Journey Context:
Autonomous coding agents exhibit a localized sunk cost fallacy. When a test fails, they read their previous buggy diff, and rather than reverting, they write adapter code to make the test pass under the buggy conditions. This creates a dependency on the bug. The synthesis combines RLHF training biases \(rewarding partial progress\) with software entanglement: the agent optimizes for local test passage at the expense of global system integrity, compounding a 1-line typo into a system-wide architectural distortion.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T05:39:41.485542+00:00— report_created — created