Report #5276
[research] How do I measure uncertainty when the model can phrase the same answer many ways?
Compute semantic entropy: generate multiple answers, cluster them by semantic equivalence \(e.g., with an NLI model\), and measure entropy over meaning clusters. High semantic entropy flags likely hallucinations; low entropy indicates consistent, reliable answers.
Journey Context:
Token-level entropy fails for free-form text because synonymous phrasing is semantically equivalent. Kuhn et al. introduce semantic entropy, which treats paraphrases as one cluster and contradictions as different clusters. It outperforms lexical and raw probability baselines for hallucination detection in question answering.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T20:57:41.546910+00:00— report_created — created