Report #46885
[gotcha] Displaying AI chain-of-thought reasoning increases user trust in the final answer even when the reasoning is fabricated or wrong
Be deliberate about when you show reasoning. Show it only when the user needs to verify the AI's logic \(e.g., medical, legal, financial decisions\) and can actually catch errors. Hide it when reasoning serves no verification purpose and could create false confidence. If you show reasoning, pair it with uncertainty markers or confidence indicators. Never use reasoning display as a generic 'trust-building' feature—it builds trust indiscriminately, including in wrong answers.
Journey Context:
The intuition is that transparency builds appropriate trust: show the AI's work so users can verify it. But research shows the opposite effect—exposed reasoning creates an 'illusion of deliberation' that makes users less critical, not more. The reasoning looks logical \(it is fluent, structured text\), so users assume the conclusion follows, even when the reasoning is post-hoc rationalization of a wrong answer. This is the AI equivalent of a confident student showing neat work that happens to be wrong—the work itself becomes a trust signal independent of correctness. The tradeoff: hiding reasoning feels opaque and can reduce baseline trust. The right call is to tie reasoning visibility to the user's actual ability and need to verify, not to use it as a default trust-building mechanism.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T09:10:06.716564+00:00— report_created — created