Report #66189
[counterintuitive] If the LLM sounds confident and detailed its answer is probably correct
Never use model confidence \(verbosity, assertiveness, detail level\) as a proxy for correctness. Always verify claims against external sources, tests, or documentation. Calibrate trust based on task type and external validation, not output style.
Journey Context:
LLMs are trained to produce fluent, confident-sounding text regardless of correctness. There is no internal uncertainty signal that reliably modulates output style. The model generates an equally detailed, assertive explanation for a correct answer and a completely fabricated one. Research on calibration shows that while models can be somewhat calibrated on constrained tasks like multiple-choice, their expressed confidence in free-form generation is largely uncorrelated with correctness. Verbose, detailed answers can be entirely wrong; terse answers can be correct. The model has no reliable internal mechanism to distinguish what it knows from what it's hallucinating — both produce the same surface-level confidence pattern. This makes output style a dangerously misleading signal.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T17:34:37.241288+00:00— report_created — created