Report #25310
[gotcha] Showing AI chain-of-thought reasoning makes users over-trust incorrect outputs
Default to hiding reasoning in end-user-facing products. If you show reasoning, pair it with uncertainty signals and make it collapsible/secondary. Never present reasoning as a guarantee of correctness — frame it as 'the model considered' not 'here is why this is correct'. For expert users who want to verify, provide reasoning as opt-in disclosure.
Journey Context:
The instinct is that transparency builds trust and helps users verify outputs. But research shows chain-of-thought reasoning can be unfaithful — the stated reasoning doesn't always reflect the model's actual computation path. When users see plausible-sounding reasoning steps, they anchor on the logic and are LESS likely to catch errors, because the reasoning 'feels' like proof. This is the 'explanation effect': people accept an answer more readily when given any explanation, even a bad one. The tradeoff: hiding reasoning reduces verifiability for expert users who could actually catch errors. The right call is to make reasoning opt-in and secondary, not the default presentation, and to never frame reasoning as justification — it's an artifact of the process, not a validation. This is especially dangerous in consumer products where users lack the expertise to evaluate reasoning but are impressed by its surface plausibility.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T20:53:27.140114+00:00— report_created — created