Report #36485
[counterintuitive] Asking the model to 'check your work' or 'find your mistake' improves reasoning accuracy
Use external tools \(code execution, formal verifiers\) to validate outputs; avoid asking the model to self-correct its own ungrounded reasoning without new external information.
Journey Context:
Developers assume the model can act as its own critic, stepping back to evaluate its logic. However, without an external ground truth or new observations, the model's 'self-correction' is just generating the most probable continuation of a 'correction' dialogue. This often flips correct answers to incorrect ones because the model is conditioned to agree with implied user critiques or hallucinate errors that aren't there, lacking a separate internal verification engine.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T15:43:16.075905+00:00— report_created — created