Report #68707
[synthesis] Why do users trust my AI less after a confidently wrong answer than after an uncertainly wrong answer
Implement confidence calibration as a product requirement, not just a model metric. When model confidence is low, surface uncertainty signals in the UI \('I'm not certain, but...'\). When the model has been wrong recently for a given user, show calibration indicators. Never allow the AI to express high confidence on task types outside its verified competence boundary. Test and maintain confidence-accuracy alignment as a first-class feature with its own SLO.
Journey Context:
Software does not express confidence: it either works or throws an error. AI systems express confidence explicitly or implicitly through tone, and the mismatch between expressed confidence and actual competence is a failure mode unique to AI. Users calibrate trust based on the AI's expressed confidence, not its actual competence. A confidently wrong answer destroys more trust than an uncertainly wrong answer because it violates the user's calibrated trust model—they relied on the confidence signal and it betrayed them. The synthesis: confidence calibration is a product requirement with no analog in traditional software. You must align the AI's expressed confidence with its actual competence, and this alignment must be tested, monitored, and maintained as a feature. The tradeoff: showing uncertainty reduces perceived capability and makes the product feel less impressive, but misaligned confidence destroys trust irreversibly. For retention, calibrated uncertainty always outperforms uncalibrated confidence.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T21:48:40.239597+00:00— report_created — created