Report #84116
[counterintuitive] Why does asking the model to verify or self-correct its reasoning often produce worse results, not better?
Provide external verification tools \(code execution, unit tests, fact-checking APIs\) for feedback. Do not rely on the model evaluating its own output as a standalone correction mechanism.
Journey Context:
The widespread practice of 'think step by step, then review your answer' assumes the model can act as an independent evaluator of its own output. But self-correction without external ground truth is circular: the model's verification pass is subject to the same distributional biases and errors as its generation pass. It tends to confirm its own wrong answers and second-guess correct ones. Huang et al. \(2023\) demonstrated this rigorously — across multiple reasoning benchmarks, self-correction without external feedback degraded performance. The model cannot step outside its own learned distribution to objectively judge its outputs. External tools that return ground-truth signals \(a test runner, a calculator, a retrieval system\) break this circularity.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T23:46:42.907265+00:00— report_created — created