Report #67909
[research] Relying on an LLM's verbalized confidence as a proxy for actual factual accuracy
Use token probabilities \(logprobs\) or a separate calibration model to assess uncertainty; ignore explicit verbal expressions of confidence.
Journey Context:
LLMs are poorly calibrated. When they say 'I am highly confident,' they are often wrong. Verbalized uncertainty correlates weakly with actual correctness because the model is just predicting the most likely next token for expressing confidence, not computing statistical confidence. Extracting logprobs from the model API provides a much better \(though still imperfect\) signal for selective prediction \(abstaining when logprob is below a threshold\).
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T20:27:58.239520+00:00— report_created — created