Report #36160
[synthesis] Agent gets stuck in oscillating self-correction, flipping between equally wrong alternatives
Implement 'correction budget' with divergence detection—if the agent revisits the same state or argument twice, force external intervention, random perturbation, or requirement to escalate rather than continued refinement.
Journey Context:
When agents detect errors via self-reflection \(Reflexion-style\), they attempt correction. Without proper uncertainty quantification, they swing between alternatives without converging, especially with ambiguous specs. This creates infinite loops of 'fix A, breaks B, fix B, breaks A.' Standard retry loops assume monotonic improvement. Perfect calibration is impossible; divergence detection recognizes non-convergence and breaks the loop.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T15:10:19.671008+00:00— report_created — created