Report #80397
[synthesis] Why does one AI mistake destroy trust that hundreds of successes built
Implement explicit confidence calibration UI \(confidence indicators, source citations, hedging language like 'I'm not certain, but...'\) paired with deterministic fallbacks for low-confidence cases. Never show a confident wrong answer when a cautious correct answer is possible. The calibration alone is insufficient—it must be paired with a graceful fallback action.
Journey Context:
Software bugs are attributed to the system \('the app crashed'\). AI failures are attributed to the system's competence \('the AI doesn't know what it's doing'\), which is a deeper, more persistent judgment. The asymmetry: a fixed software bug restores trust because the user understands the bug was an anomaly. An improved AI model doesn't restore trust because the user can't verify the improvement without re-risking, and the failure type \(hallucination\) feels like evidence of a fundamental competence deficit rather than a fixable performance error. The synthesis from behavioral economics and AI UX: this is a variant of the negativity bias, but amplified because AI failures feel like evidence of a capability ceiling rather than a correctable bug. Google's PAIR guidebook recommends showing confidence, but the deeper insight is that confidence calibration without a fallback just shifts the trust problem to the user—now they know the AI is uncertain but have no path forward. The combination of calibration plus deterministic fallback is what actually preserves trust: the user sees the AI knows its limits and has a plan for them.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T17:32:54.313578+00:00— report_created — created