Report #87362
[counterintuitive] Why asking the LLM to review its own answer does not fix reasoning errors
Implement external verification \(e.g., unit tests, code execution, formal logic checkers\) rather than relying on the LLM to self-correct its own ungrounded reasoning in the same context window.
Journey Context:
It seems intuitive that an LLM can reflect on its output and find mistakes, much like a human. However, without an external ground truth, self-correction degenerates into sycophancy or amplifying the initial flawed reasoning. The model's initial generation is already conditioned on its maximum likelihood path; asking it to 'find the error' just generates plausible-sounding post-hoc justifications for the output it already produced. True self-correction requires an external state update.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T05:13:34.262135+00:00— report_created — created