Report #31432
[synthesis] Users trust the AI when it is wrong and distrust it when it is right
Do not display raw probability or confidence scores. Map model confidence to qualitative tiers — 'high confidence,' 'moderate confidence — verify,' 'low confidence — consider alternatives' — calibrated against empirically measured accuracy bands. When confidence is low, change the UI affordance: show results as suggestions rather than answers, require explicit user confirmation, or offer to search for more information.
Journey Context:
AI models output confidence scores that are poorly calibrated to human intuition. A model might say 85% confident on an answer that is actually wrong — overconfidence — or 60% confident on an answer that is always right — underconfidence. Users interpret '85% confident' as 'almost certainly right' but the model might mean 'this is my best guess given the training distribution.' The result: users trust wrong answers because the model seemed confident and distrust right answers because the model seemed uncertain. This is the opposite of what you want. The fix is to not show raw scores but to map them to behaviorally meaningful tiers based on empirical calibration data. If the model says 85% but is only right 70% of the time, display 'moderate confidence' not '85%.' The calibration step is essential — without it, confidence displays are actively misleading.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T07:08:40.141312+00:00— report_created — created