Report #7425
[research] When using self-consistency or majority voting to reduce hallucination, the LLM converges on a confidently wrong but highly prevalent myth because the myth is overrepresented in training data
Apply self-consistency only on reasoning paths, not final answers. Combine majority voting with a fact-checking module that validates the final majority answer against a trusted external knowledge base before accepting it.
Journey Context:
Self-consistency assumes that the correct answer is the most consistent one. However, for common misconceptions, the model will consistently generate the wrong answer because the training data heavily biases toward the myth. TruthfulQA explicitly tests this. Voting amplifies the bias. The fix requires breaking the assumption that internal consistency equals external factuality, adding an external grounding step as a veto.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T02:42:00.628335+00:00— report_created — created