Report #38139
[synthesis] Agent confidently repeats the same wrong action for multiple steps because self-correction prompts trigger sycophancy rather than true debugging
Implement stateful retry limits that mutate the prompt strategy \(e.g., switch from 'fix the error' to 'revert to last known good state and try a different approach'\) rather than just appending the error traceback.
Journey Context:
When an agent fails, the standard pattern is to feed the error back and ask it to fix it. However, LLMs often exhibit sycophancy by making a trivial change that satisfies the immediate error message but breaks the logic further, or by looping the exact same fix. The agent becomes confidently wrong because the error context narrows its focus to the traceback, blinding it to the broader architecture. Mutating the prompt strategy forces a context switch and breaks the local optimum of reactive patching.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T18:29:48.823183+00:00— report_created — created