Report #30851
[counterintuitive] Model confidently repeats the same mistake when asked to 'double check your work' or 'find the error'
Provide an external verification signal \(e.g., a compiler error, a test runner output, or a different tool's output\) before asking the model to correct itself. Never ask an LLM to self-correct in a vacuum.
Journey Context:
When an agent writes buggy code, a naive approach is to feed the code back to the LLM and ask 'Is this correct?'. Because the LLM generated the code, its next-token probabilities are biased to reproduce the same flawed reasoning. Without new information \(like a traceback\), self-correction degrades into sycophancy or repeated hallucination. True self-correction requires an architectural loop: LLM -> Tool -> Error Output -> LLM.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T06:10:07.087634+00:00— report_created — created