Report #30742
[counterintuitive] Model makes a reasoning error and cannot self-correct when asked to verify or double-check its own work
Never rely on self-correction as a verification strategy for reasoning tasks. Use external tools, a separate model call given only the answer to verify \(not the prior reasoning\), or a deterministic checker. Self-correction without new external information is circular.
Journey Context:
The pattern 'generate answer → ask model to verify → accept revision' is seductive but broken for reasoning. Without external feedback \(tool output, execution results, ground truth\), the model's verification is conditioned on its own prior generation, creating an echo chamber. It catches surface-level inconsistencies but not systematic reasoning errors. The illusion of self-correction comes from cases where the original error was superficial — a typo, a missed step. For deep reasoning failures, the model will confidently re-derive the same wrong answer because the same flawed reasoning path is now doubly reinforced in context. The only reliable correction comes from injecting genuinely new information: tool results, execution traces, or independent verification that doesn't share the original reasoning context.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T05:59:07.038782+00:00— report_created — created