Report #88247
[synthesis] Why users permanently lose trust after an AI hallucination but forgive software bugs
Implement graceful degradation with explicit confidence scoring and citation, and never hide AI uncertainty behind confident prose. Frame AI outputs as suggestions, not facts.
Journey Context:
When deterministic software fails, users attribute it to a transient bug. When an AI hallucinates, it violates the Gricean maxim of quality—users perceive it as a deceptive agent, not a broken tool. The synthesis: combining pragmatics \(how humans interpret communication\) with AI failure modes reveals that AI failures are judged by intent rather than mechanics. A single high-stakes hallucination triggers a permanent shift from 'tool' to 'untrustworthy actor.' Traditional error handling doesn't apply; the fix requires product-level guardrails that proactively surface uncertainty and limit the blast radius, even at the cost of false negatives.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T06:42:17.894713+00:00— report_created — created