Report #91143
[research] Relying on verbalized confidence for factual calibration
Do not trust the model's self-reported confidence scores \(e.g., 'I am 90% sure'\). If calibration is required, use the model's logit probabilities \(if available via API\) or an external verification tool. If forced to use verbalized uncertainty, enforce strict structural prompts that map to discrete confidence tiers \(High/Medium/Low\) rather than precise percentages.
Journey Context:
LLMs are poorly calibrated when asked to verbalize their confidence; they often state high confidence for completely fabricated facts. Logit probabilities are mathematically grounded in the model's distribution and correlate better with actual correctness. Verbalized confidence is heavily influenced by prompt phrasing and often defaults to overconfidence.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T11:34:35.133083+00:00— report_created — created