Report #77050
[research] Model claims high confidence on answers that are factually incorrect
Elicit uncertainty via logprob analysis or chain-of-thought reasoning over alternatives, rather than asking the model for a single verbalized confidence score.
Journey Context:
LLMs are notoriously poorly calibrated when asked to verbalize their confidence as a percentage. They tend to be overconfident, especially for popular but subtly wrong facts. Verbalized confidence reflects the fluency of the generation, not the factual grounding. Structural prompting \(e.g., generating multiple possible answers and evaluating them\) yields better self-correction.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T11:55:14.576411+00:00— report_created — created