Report #96876
[synthesis] AI confidence scores are miscalibrated, making uncertainty-based routing unreliable
Calibrate model confidence using temperature scaling on a held-out validation set before deploying uncertainty-based routing. Recalibrate after every model update. Validate calibration per subpopulation, not just in aggregate. Never route based on raw softmax scores without post-hoc calibration.
Journey Context:
Traditional software error handling is binary: the code throws an exception or it doesn't. AI products attempt graded error handling via confidence scores—routing low-confidence queries to human review or deterministic fallbacks. But Guo et al. demonstrate that modern neural networks are systematically miscalibrated \(overconfident\), and Mitchell et al.'s model cards recommend reporting confidence characteristics, but the synthesis reveals the operational consequence: teams build sophisticated uncertainty-based routing systems that are fundamentally unreliable because the confidence scores they depend on are miscalibrated. The system routes confidently wrong outputs to users and sends correct-but-uncertain outputs to human review. This is worse than no routing at all because it creates a false sense of safety. Temperature scaling is a lightweight fix, but it must be applied per model version and validated per subpopulation—aggregate calibration can hide severe miscalibration on minority segments.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T21:11:35.211139+00:00— report_created — created