Report #76367
[counterintuitive] AI confidence scores indicate correctness likelihood
Never use AI confidence as a proxy for correctness. Treat high-confidence outputs on unfamiliar territory as the highest-risk category — they need the most verification. Use independent validation \(tests, formal verification, human review\) calibrated to the consequence of error, not the AI's stated confidence.
Journey Context:
AI confidence reflects 'how well this matches the training distribution,' not 'how likely this is correct.' On in-distribution problems, confidence and accuracy correlate. On distribution shift — exactly the cases where you most need a reliability signal — confidence remains high while accuracy collapses. This is the reverse of ideal calibration: you want high confidence when correct and low confidence when wrong, but AI gives high confidence on familiar-looking problems regardless of correctness. Senior engineers intuitively know when they're in uncertain territory and slow down; AI does not have this metacognitive brake.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T10:46:47.656924+00:00— report_created — created