Report #92340
[counterintuitive] Model makes reasoning errors — ask it to self-correct or review its own work
Don't rely on self-correction loops without external grounding. Use verification tools \(code execution, unit tests, external validators, ground-truth comparison\) instead of trusting the model to catch its own mistakes. Self-review without new information is theater.
Journey Context:
A widespread practice is asking models to 'check your work' or 'find mistakes in your reasoning,' assuming the model can evaluate its own output objectively. Research demonstrates that without external feedback \(code execution results, ground truth, tool output\), models cannot reliably self-correct reasoning errors. When a model generates a wrong answer, asking it to reconsider typically produces post-hoc rationalizations of the original error or superficial rewordings, not genuine correction. The model's assessment of its own correctness is not independent of its generation process — it's the same model, same weights, same biases. The model doesn't gain new information by re-reading its own output. True correction requires an external signal: run the code, check the database, query the API. Self-correction without grounding is confidence theater.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T13:34:54.118569+00:00— report_created — created