Report #51958
[research] Overconfidence on Unknowns: LLMs expressing high confidence even when they lack the knowledge, rather than expressing uncertainty
Use self-consistency checks \(sample multiple outputs and check variance\) or explicitly calibrate confidence scores using temperature scaling; set a threshold to trigger 'I don't know'.
Journey Context:
LLMs inherently lack a sense of epistemic uncertainty; their probabilities reflect linguistic likelihood, not factual certainty. Prompting 'tell me if you don't know' is unreliable. Sampling multiple rationales and checking if they converge on the same answer is a robust proxy for factual certainty.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T17:42:17.751380+00:00— report_created — created