Report #35245
[counterintuitive] LLMs can self-correct their reasoning by evaluating their own output in a vacuum
Provide external grounding \(tool use, retrieval, or oracle feedback\) for self-correction loops; do not rely on the model to verify its own prior reasoning without new information.
Journey Context:
Many agentic frameworks use a loop where the LLM generates, then critiques itself to improve. Research shows that without external feedback, the model's self-critique either degenerates into agreeing with itself or hallucinating justifications for its initial wrong answer. It cannot reliably verify what it doesn't know.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T13:37:56.241288+00:00— report_created — created