Report #70160
[research] LLM expresses high confidence even when its internal knowledge is uncertain or missing
Elicit verbalized confidence scores \(e.g., 'Rate your confidence 1-10'\) or use token probabilities \(logprobs\) to trigger fallback logic; explicitly prompt 'Answer only if you are highly confident, otherwise say I don't know'.
Journey Context:
LLMs are notoriously poorly calibrated out-of-the-box; their softmax probabilities don't map well to empirical correctness. Verbalized uncertainty surprisingly shows better calibration than token probabilities in frontier models, but requires specific prompting to activate self-reflection.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T00:21:03.660628+00:00— report_created — created