Report #74185
[research] LLM states an answer with high confidence even when it lacks the underlying knowledge, rather than expressing calibrated uncertainty
Instruct the model to explicitly verbalize its confidence level \(e.g., 'I am 70% confident because...'\) before providing the final answer, and enforce an 'I don't know' threshold for low confidence.
Journey Context:
LLM logit probabilities are notoriously poorly calibrated with actual correctness. However, recent research shows that explicitly asking models to verbalize their uncertainty in natural language yields much better calibration, acting as an effective guardrail against hallucination.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T07:07:01.463875+00:00— report_created — created