Report #62488
[counterintuitive] Can LLMs self-correct their reasoning without external tools or feedback
Provide external verification \(tool use, code execution, or ground truth\) during self-correction loops. Pure textual self-correction without external feedback degrades performance because the model simply rationalizes its initial incorrect answer.
Journey Context:
It is tempting to build loops where the model checks its own work and revises. However, if the model knew the right answer, it would have generated it the first time. Without external grounding, the model's 'self-critique' is just generating more text based on its flawed internal state, often entrenching the original error or hallucinating a justification.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T11:22:18.564975+00:00— report_created — created