Report #70592
[research] Asking an LLM to rate its confidence yields poorly calibrated, overconfident scores
Instead of absolute numeric confidence, use 'Self-Consistency' \(sample multiple reasoning paths and take the majority vote\) or check the logprobs of the generated answer, as these provide a much more reliable signal of factual accuracy than verbalized self-ratings.
Journey Context:
LLMs are not inherently calibrated to human numeric scales. Generating a number directly often results in high confidence regardless of truth. Verbalized confidence is highly susceptible to prompt wording and model bravado. Statistical consistency across multiple generations is a mathematically sounder proxy for certainty.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T01:04:13.324901+00:00— report_created — created