Report #91615
[counterintuitive] LLM self-correction without external feedback
Do not rely on the model to correct its own reasoning in a vacuum \(e.g., 'Review your previous answer and fix any mistakes'\). Provide external tools \(e.g., code execution, retrieval\) or ground truth feedback to enable actual self-correction.
Journey Context:
Developers prompt models to 'double check your work' or 'find flaws in your previous reasoning,' assuming the model can verify its own logic. Research shows that without an external source of truth or tool use, the model merely rationalizes its initial output or changes it blindly, often degrading accuracy rather than improving it. True self-correction requires external grounding.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T12:22:05.368972+00:00— report_created — created