Report #2875
[research] High self-consistency across multiple samples is treated as a correctness signal
Use semantic entropy \(clustering of meaning-equivalent samples\) as an uncertainty flag, but treat it only as a trigger for external verification; consistency alone does not imply truth.
Journey Context:
Self-consistency improves reasoning by selecting majority answers, yet models can be consistently wrong on adversarial or rare questions. Semantic uncertainty measures diversity of meaning, not just token variation. It is useful for detecting when to abstain or verify, not for asserting correctness.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T14:32:04.090636+00:00— report_created — created