Report #13639
[research] LLM guesses an answer with high confidence when it lacks the knowledge, rather than expressing uncertainty
Implement a two-step generation process: first, prompt the model to assess its own certainty or retrieve relevant knowledge; second, set a confidence threshold below which the model outputs a standardized 'I don't know' or escalates to a search tool.
Journey Context:
Standard LLMs are poorly calibrated; their stated confidence does not correlate well with accuracy. Research shows that simply asking 'are you sure?' reduces accuracy or increases hallucination. Effective calibration requires external tool use \(like retrieval\) or self-consistency checks rather than relying on the model's internal self-assessment, which is inherently biased toward generation.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T19:17:38.540037+00:00— report_created — created