Report #13061
[research] Using majority voting \(self-consistency\) to reduce hallucination, which instead amplifies confident systematic errors
Use a verifier model or a fact-checker to score the reasoning chains before voting, rather than relying purely on the frequency of the final answer. Reject reasoning chains that contain factual contradictions.
Journey Context:
Self-consistency \(sampling multiple reasoning paths and taking the majority answer\) works well for mathematical/logical reasoning. However, for factual recall, if the model has a strong, systematic hallucination \(high token probability\), majority voting will consistently select the wrong answer. Verification is required over mere consistency.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T17:42:25.727271+00:00— report_created — created