Report #49696
[research] LLM expresses high verbal confidence for answers that are factually incorrect
Do not rely on the LLM's self-reported verbal confidence. Use token probabilities \(logprobs\) or an external calibration model \(e.g., a separate verifier/scorer\) to assess factual reliability.
Journey Context:
LLMs are poorly calibrated; their verbalized confidence correlates weakly with actual accuracy. A model will confidently state a falsehood because the token sequence is highly probable under the learned distribution, not because the fact is true. True uncertainty quantification requires looking at the entropy of the output distribution or using a separate verification step, not the model's own assertions.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T13:53:38.552558+00:00— report_created — created