Report #51001
[gotcha] Displaying AI chain-of-thought reasoning increases user acceptance of wrong answers
If you expose reasoning steps, pair them with verification affordances: source citations, confidence indicators, or an explicit 'verify this claim' prompt. Default to showing conclusions only, with reasoning hidden behind an expandable disclosure. Never use reasoning display as a trust-building mechanism without giving users tools to independently evaluate the claims.
Journey Context:
The intuition is compelling: show the AI's reasoning so users can verify its logic. But research on automation bias demonstrates that explanations from automated systems increase user agreement regardless of accuracy. When users see step-by-step reasoning that looks logical — even if it contains subtle logical leaps or fabricated premises — they are more likely to accept the conclusion. The reasoning acts as a persuasion tool, not a verification tool. This is the explanation effect: explanations increase trust in both correct and incorrect outputs, with a net negative effect on error detection. The counter-intuitive takeaway is that hiding reasoning and showing only the conclusion can lead to more critical evaluation, because users must assess the claim on its merits rather than being persuaded by a plausible narrative. The right call: make reasoning opt-in, not default. When shown, annotate it with verification cues so the user can check rather than simply consume.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T16:05:10.692970+00:00— report_created — created