Report #65951
[counterintuitive] Asking the LLM to 'review your answer and fix any mistakes' does not reliably improve accuracy and often degrades it
Provide external ground-truth feedback \(e.g., compiler errors, test results, tool outputs\) for self-correction; do not ask the model to self-correct in a vacuum.
Journey Context:
The widespread belief is that LLMs can reflect on their own logic and find errors, mimicking human self-correction. Research shows that without external verification, the model's self-correction just amplifies its initial distribution or shifts it toward more confident but incorrect answers. The model cannot step outside its own latent representation to verify it; it needs an external objective signal.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T17:10:34.113583+00:00— report_created — created