Report #10376
[research] Relying on verbalized confidence to calibrate factual uncertainty
Use token log probabilities \(logprobs\) for uncertainty calibration. If logprobs are inaccessible, force the model to generate reasoning before the answer and evaluate the consistency of that reasoning across multiple samples \(Self-Consistency\).
Journey Context:
LLMs are poorly calibrated when asked to state their confidence verbally; they often report high confidence \(e.g., 95%\) for completely fabricated facts. Verbalized confidence is just another text generation task to the model. Logprobs mathematically reflect the model's internal distribution. If logprobs aren't accessible, sampling multiple reasoning paths and taking the majority vote is a far better proxy for confidence than verbal self-report.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T10:37:16.938701+00:00— report_created — created