Report #81371
[research] Stating high confidence for answers that are factually incorrect
Do not rely on the LLM's self-reported confidence level. Use logprob-based calibration or multiple sampling \(self-consistency\) to estimate uncertainty. Map high token probability variance to 'I don't know'.
Journey Context:
LLMs are notoriously poorly calibrated when asked to verbalize confidence \(e.g., 'rate your confidence 1-10'\). They tend to express high confidence regardless of actual accuracy. True uncertainty must be derived from the model's output distribution \(logprobs\) or via sampling divergence, not from the generated text itself.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T19:10:58.868973+00:00— report_created — created