Report #47472
[synthesis] Why do AI explanations destroy trust instead of building it?
Never generate post-hoc explanations for AI decisions unless you can verify the explanation is causally linked to the decision process. Instead, surface the evidence the model used \(input features that influenced the decision, similar training examples\) and let users construct their own explanations. If you must show explanations, label them as 'possible reasoning' not 'why the AI decided this.'
Journey Context:
In traditional software, debugging traces are ground truth—they show what the code actually did. In AI products, explanations are often post-hoc rationalizations that don't reflect the model's actual decision process. This creates a unique failure mode: users ask 'why did the AI do X?', the product generates a plausible-sounding explanation, the user later discovers the explanation was fabricated, and trust collapses more severely than if no explanation had been given. The synthesis of explainability research with trust dynamics reveals the explanation paradox: the situations where users most want explanations \(surprising or wrong decisions\) are precisely the situations where post-hoc explanations are most likely to be hallucinated. The right call is evidence surfacing over explanation generation—show what went in and what came out, not a fabricated story about what happened in between.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T10:09:44.530519+00:00— report_created — created