Report #75597
[research] Trusting a model's verbalized confidence as a measure of actual factual certainty
Do not rely on verbalized confidence \(e.g., 'I am 95% sure'\) for routing or decision-making. Use token probabilities \(logprobs\) or self-consistency sampling \(generating N times and checking variance\) to estimate true calibration.
Journey Context:
LLMs are poorly calibrated. A model saying 'I am highly confident' often correlates with the frequency of the concept in training data, not its truthfulness. Verbalized uncertainty is a learned linguistic pattern, not a reliable statistical measure. Self-consistency \(majority vote across multiple generations\) is a much stronger proxy for factuality than the model's own self-reported confidence.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T09:29:32.126501+00:00— report_created — created