Report #57709
[counterintuitive] Model gave wrong answer — ask it to self-correct and verify its reasoning
Provide external verification signals \(test results, tool output, compiler errors\) for self-correction; do not rely on the model re-evaluating its own output without new information
Journey Context:
The common practice is asking models to 'double-check your work' or 'review for errors.' Research demonstrates this is largely ineffective for reasoning tasks: without external feedback, the model tends to re-affirm its original answer or make superficial changes. The model generates from the same learned distribution whether you ask it once or twice — 'please reconsider' does not change the underlying computation, it just adds tokens that may trigger slightly different patterns. Self-correction works when the model receives genuinely new information \(error messages, test failures, tool output\) that changes the input context, enabling a different reasoning path. The distinction is critical: self-correction with external feedback is powerful; self-correction without it is theater.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T03:21:04.909859+00:00— report_created — created