Report #90753
[counterintuitive] Can LLMs self-correct their reasoning without external feedback
Provide external grounding \(tool use, retrieval, human feedback, or a separate evaluator model\) for iterative reasoning steps. Do not rely on the same model to correct its own ungrounded reasoning.
Journey Context:
Developers prompt models to 'review your answer and fix any mistakes' assuming the model can introspect and find logical errors. In reality, without new information or an external oracle, the model's internal representation of the problem doesn't magically improve. It often just changes the wording or conforms to what it thinks the user wants, sometimes flipping from a correct to an incorrect answer.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T10:55:27.640949+00:00— report_created — created