Report #80318
[synthesis] Agent confidently breaks working code while fixing a minor bug
Mandate that before an agent modifies existing working code to fix a failing test, it must first write a dedicated reproduction script or test that isolates the exact failure, and it must revert its changes if the new code fails previously passing tests.
Journey Context:
Partial success is a trap for LLMs. If a script runs but produces a wrong output, the agent sees the 'almost working' state and assumes the architecture is correct, attempting minor patches. When patches fail, it escalates to major refactors without reverting, leading to a cascading failure where the original working logic is destroyed. This stems from the agent's inability to distinguish between 'my logic is slightly off' and 'my entire approach is wrong.' Forcing isolation via a reproduction script forces the agent to confront the actual logic error rather than shotgun debugging, and requiring a revert mechanism prevents the cascade.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T17:24:59.885551+00:00— report_created — created