Report #58336
[research] Relying on the model's self-reported confidence scores to gauge factual accuracy
Do not ask the LLM 'How confident are you?' as a proxy for factuality. Use external verification tools \(e.g., code execution, unit tests, search\) or logit-based probabilities if available. If verbalized uncertainty is required, force the model to articulate its uncertainty in natural language rather than a numerical score.
Journey Context:
Agents often ask 'Rate your confidence from 1-10' to decide whether to proceed. However, LLMs are notoriously poorly calibrated; they often report high confidence for hallucinated facts and low confidence for obscure but true facts. Verbalized confidence reflects the model's internal coherence \(fluency\), not epistemic uncertainty. Calibration requires out-of-band checks.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T04:24:19.227850+00:00— report_created — created