Report #47916
[research] Relying on LLM self-reported confidence \('I am 90% sure'\) to gauge factual accuracy
Do not rely on verbalized confidence. Instead, use token probabilities \(logprobs\) and set a strict entropy threshold, or use a secondary LLM call specifically to critique the first answer's factual basis.
Journey Context:
Models are trained to sound authoritative. Verbalized confidence is poorly calibrated and often reflects the style of the text rather than epistemic certainty. Logprob calibration is better but still noisy. The most robust anti-hallucination pattern for unknowns is an explicit 'verify-then-generate' pipeline.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T10:54:48.728681+00:00— report_created — created