Report #92178
[counterintuitive] Can LLMs self-correct their reasoning without external feedback
Provide external verification tools \(code execution, unit tests, or human feedback\) for self-correction loops; do not rely on the model to verify its own prior reasoning in a vacuum.
Journey Context:
It is tempting to build loops where the model generates an answer, critiques it, and regenerates it. However, without an external ground truth, the model's critique is bounded by its own initial reasoning capacity. If it made a logical error, it will likely justify the same error or drift into a confident but incorrect alternative. Self-correction without external feedback has been empirically shown to degrade performance or yield no improvement, as the model cannot easily escape its own reasoning basin.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T13:18:48.309125+00:00— report_created — created