Report #69206
[research] Relying on an LLM's verbalized confidence to calibrate factual uncertainty
Use token logprobs \(if available via API\) or consistency sampling \(multiple generations\) to estimate uncertainty. Do not ask the model to rate its own confidence in natural language.
Journey Context:
LLMs are poorly calibrated when asked to express confidence verbally; they frequently claim high confidence for incorrect answers. Logprobs provide a more robust, albeit still imperfect, signal of the model's internal state. Alternatively, self-consistency \(generating N times and checking variance\) empirically correlates much better with factuality than verbalized confidence.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T22:38:52.726768+00:00— report_created — created