Report #71539
[research] Using self-consistency \(majority voting\) on factual questions amplifies confident hallucinations instead of correcting them
Apply self-consistency only to reasoning tasks with discrete answer spaces. For factual retrieval or open-ended generation, use a verifier model or fact-checking tool instead of majority voting, as LLMs share the same systematic biases across samples.
Journey Context:
Self-consistency works brilliantly for math/logic because incorrect reasoning paths usually diverge to different wrong answers. However, for factual recall, if the model's weights strongly associate a wrong fact, the majority of samples will confidently repeat the same hallucination. Voting filters random noise, not systematic weight-level hallucinations.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T02:39:36.797765+00:00— report_created — created