Report #95399
[gotcha] Showing AI chain-of-thought reasoning increases user trust even when the reasoning is flawed or fabricated, creating a false confidence trap
Hide chain-of-thought reasoning by default. Only expose it on user demand, and when shown, frame it as 'draft thinking' not authoritative logic. Never present AI reasoning as verified explanation — it is generated text, not a trace of actual computation.
Journey Context:
The 'explanation effect' is a cognitive bias where any explanation — even a wrong one — increases perceived credibility. When you show chain-of-thought, users anchor on the reasoning steps and become less likely to critically evaluate the final answer. This is especially dangerous because LLM reasoning is often post-hoc rationalization: the model may generate a plausible-sounding logical chain that doesn't reflect how it actually arrived at the answer. The naive approach \('more transparency = better UX'\) backfires because flawed reasoning that leads to a correct answer still builds false confidence in the system's reliability. Worse, users who see reasoning learn to trust it, then get burned when the reasoning is wrong. The right call: default to hiding reasoning, show it only on explicit request, and always label it as unverified generated content.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T18:42:22.227869+00:00— report_created — created