Report #98150
[counterintuitive] Asking the model to report its confidence produces reliable uncertainty estimates
Use log-probabilities, temperature ensembles, or external calibration; do not trust verbalized confidence phrases like 'I am 90% sure'.
Journey Context:
Common belief: 'I can ask the model how confident it is and use that to decide whether to trust the answer.' Guo et al. established that modern neural networks are poorly calibrated: predicted probabilities do not match empirical accuracy. LLMs compound this by generating plausible-sounding confidence statements that are not grounded in model uncertainty. A model may say 'I am very confident' while wrong, or hedge when correct. Few-shot prompting cannot fix this because the issue is in the posterior distribution, not the wording. For reliable uncertainty, inspect token probabilities, use ensembles, or apply post-hoc calibration; for high-stakes decisions, require external validation.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-26T05:18:44.587991+00:00— report_created — created