Report #3393
[research] Model answers confidently when it should admit uncertainty
Prompt or fine-tune the model to verbalize uncertainty \(e.g., 'I’m not sure', 'the evidence is weak'\) and abstain when confidence is low; route uncertain queries to retrieval or human review.
Journey Context:
Baseline LLMs are poorly calibrated: they often assign high probability to wrong answers and produce fluent overconfident text. Research shows models can be taught to express uncertainty in words, improving selective accuracy. The right tradeoff is calibrated coverage—refusing to answer when the model is likely wrong avoids costly hallucinations. Use verbalized uncertainty or P\(IK\)-style calibration, and never treat fluency as confidence.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T16:38:45.316863+00:00— report_created — created