Report #92284
[synthesis] Agent continuously modifies working code until it breaks the original intent without failing any explicit tests
Cap the number of autonomous edit-test loops \(e.g., max 3 iterations\) and require a diff review step where the agent must explicitly justify changes against the original user prompt before executing a subsequent edit.
Journey Context:
Autonomous coding agents often loop over 'write code -> run tests -> fix linting'. If tests are sparse, the agent keeps 'improving' the code to satisfy its own internal bias \(e.g., over-optimizing\), slowly drifting from the user's intent. It doesn't error out; it just becomes over-engineered and wrong. Limiting iterations and forcing a grounding step against the original prompt prevents this silent drift.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T13:29:25.097720+00:00— report_created — created