Report #65955
[synthesis] Why one AI failure undoes disproportionate successful interactions and how to design for the trust ratchet
Over-invest in reliability at the expense of capability ceiling; design for graceful degradation with explicit uncertainty signals rather than binary success/failure; implement trust-repair UX patterns after failures \(transparent explanation of what went wrong and why it won't recur for that input type\); measure and optimize for trust retention rate, not just task success rate.
Journey Context:
Dietvorst et al. demonstrated that after seeing an algorithm err, people lose confidence and shift to human judgment even when the algorithm is objectively superior. This asymmetry is product-lethal: trust loss from a single failure exceeds trust gain from many successes. For software bugs, users have calibrated expectations \(bugs happen, they get fixed\). For AI failures, users perceive them as evidence of fundamental incompetence, not transient errors. Teams commonly try to maximize AI capability \(showing off what it can do\) rather than minimizing failure modes \(ensuring what it does, it does reliably\). The counterintuitive right call: a less capable but highly reliable AI product outperforms a more capable but occasionally wrong one, because trust loss is asymmetric and compounding. This inverts the normal software product priority of feature breadth over polish.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T17:11:19.744332+00:00— report_created — created