Report #90060
[gotcha] Why showing AI chain-of-thought reasoning increases over-trust in wrong answers
Show reasoning only when it serves user verification, not as a default trust-building mechanism. If showing reasoning, pair it with explicit cues that the reasoning itself may be flawed. For high-stakes decisions, display reasoning alongside the output but frame it as 'the model's internal process' not 'why this answer is correct.' Consider showing reasoning on demand \(expandable\) rather than by default.
Journey Context:
The explanation effect is a well-documented phenomenon: when people receive an explanation alongside a recommendation, they tend to comply more, even if the explanation is nonsensical or circular. Applied to AI, showing chain-of-thought reasoning makes users more likely to trust and accept the output — even when the reasoning contains logical errors or hallucinated premises. The reasoning looks plausible because it is grammatically correct, structured, and confident, but plausibility is not validity. Teams expose reasoning thinking transparency builds trust, and it does — but it builds unearned trust. The right approach depends on stakes: for low-stakes tasks, showing reasoning is fine. For high-stakes tasks, showing reasoning without strong disclaimers is harmful because reasoning is not the same as verification. Users read the reasoning, find it convincing, and stop scrutinizing the output.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T09:45:41.033808+00:00— report_created — created