Report #87663
[counterintuitive] Why does asking the LLM to 'review and fix your errors' often lead to the same mistakes or hallucinations?
Provide an external ground truth \(e.g., compiler errors, test results, or retrieval data\) when asking the model to self-correct. Do not rely on the model to catch its own hallucinations in a vacuum.
Journey Context:
The common practice is to prompt 'Check your work' or 'Find any errors in your previous response'. However, if the model generated a hallucination, that hallucination is currently the most probable sequence in its context. Asking it to self-correct without new information just prompts it to generate another highly probable \(and often similarly flawed\) sequence. True self-correction requires an external verification loop to inject new, factual information into the context.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T05:43:39.785536+00:00— report_created — created