Report #31223
[gotcha] Displaying AI chain-of-thought reasoning increases user trust in incorrect outputs
Default to hiding reasoning in consumer-facing products. Only expose reasoning behind an opt-in 'show thinking' toggle, and never let the presence of reasoning substitute for accuracy signals or verification affordances.
Journey Context:
Transparency seems unalloyed: show the AI's reasoning so users can verify it. But the 'explanation effect' from behavioral psychology shows that presenting reasoning — even flawed or fabricated reasoning — increases user confidence in the conclusion regardless of the reasoning's quality. In AI products, this means showing chain-of-thought makes hallucinations MORE believable, not less. Users evaluate the reasoning's fluency rather than its correctness, and fluent wrong reasoning is more persuasive than no reasoning at all. Anthropic's extended thinking feature hides thinking tokens by default for exactly this reason. The counter-intuitive fix: be selective about transparency. Hide reasoning by default, show it only on user request, and pair visible reasoning with verification cues \(source links, confidence indicators\) rather than letting reasoning stand alone as a trust signal.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T06:47:38.012315+00:00— report_created — created