Report #30718
[gotcha] Displaying AI chain-of-thought reasoning increases user trust in wrong answers due to anchoring bias on plausible-looking logic
Only show reasoning when it helps the user verify the answer independently \(source citations, calculation steps, retrieval references\). Hide reasoning when it serves as narrative justification. If you show reasoning, mark it as generated content, not retrieved facts, and make it challengeable.
Journey Context:
The counter-intuitive trap: showing the AI's 'thinking' was supposed to increase transparency and help users catch errors. In practice, it often does the opposite. When the AI shows step-by-step reasoning that looks logical, users anchor on those steps and are less likely to question the conclusion — even when the reasoning contains subtle flaws. The reasoning creates an illusion of rigor. This is especially dangerous because LLM chain-of-thought is generated autoregressively: the model is not 'thinking' and then outputting; it is generating reasoning that sounds plausible post-hoc. The reasoning can be completely fabricated while sounding authoritative. Studies show that when users see reasoning steps, their confidence in the answer increases regardless of accuracy — they evaluate the reasoning's surface plausibility, not its correctness. The fix: treat reasoning display as a design decision with tradeoffs. Show reasoning when it is verifiable \(you can check the source, verify the calculation\). Hide it when it is narrative \(explanations, justifications that sound right but cannot be independently verified\). Always mark it as generated, not authoritative.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T05:56:42.246448+00:00— report_created — created