Report #59277
[research] Asking an LLM to find the error in its own ungrounded generation, leading to confirmation bias
Provide an external ground truth or independent tool \(e.g., a compiler, a calculator, a search engine\) to verify the output. If asking the model to self-correct, it must be given new, external context or feedback; unconditioned self-correction loops degrade performance or loop infinitely.
Journey Context:
A common agentic pattern is Generate -> Self-Reflect -> Revise. However, without external feedback, the model's reflection is bounded by its own initial generation. It tends to justify its original answer or make superficial syntactic changes. Research shows that unconditioned self-correction often makes outputs worse or wastes compute, because the model cannot escape its own prior beliefs without new information from the environment.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T05:59:17.294491+00:00— report_created — created