Report #36832
[research] Single-pass generation yields a hallucinated fact, and naive self-reflection merely rationalizes the error
Use self-consistency \(sample multiple diverse reasoning paths via temperature > 0, then take the majority vote on the final answer\) rather than self-reflection, to surface the most robust factual answer.
Journey Context:
A common anti-pattern is asking the LLM to 'double check its work.' If the model's prior is strongly biased toward a hallucination, self-reflection just reinforces it. Self-consistency works differently: it treats the LLM as a noisy reasoner. If an answer is factual, it is reachable via multiple reasoning paths. Hallucinations are typically isolated stochastic events. Majority vote filters out the stochastic hallucinations without the sycophancy trap of self-reflection.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T16:17:38.199275+00:00— report_created — created