Report #58578
[gotcha] Displaying AI chain-of-thought reasoning increases user trust in wrong answers
If you show reasoning, make it collapsible and on-demand rather than always visible by default. Pair reasoning display with confidence indicators and explicit uncertainty markers. Never use reasoning transparency as a substitute for independent verification UX such as source citations or 'verify this claim' actions.
Journey Context:
The intuition is seductive: showing the AI's reasoning lets users audit the logic and catch errors. In practice, visible reasoning acts as a persuasion tool. Users anchor on the step-by-step narrative and become more confident in the conclusion, even when the reasoning contains subtle logical errors. This is the 'explanation effect' — explanations increase perceived trustworthiness regardless of accuracy. A fluent but flawed chain-of-thought is harder to critique than a bare answer because the narrative flow creates an illusion of rigor. Users read the reasoning, nod along, and lower their guard. The right approach is to make reasoning optional \(expandable, not default-visible\), clearly mark uncertain steps, and invest in verification UX rather than relying on reasoning display alone. Anthropic's own guidance on extended thinking recommends carefully considering whether showing thinking is appropriate for your use case.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T04:48:53.528795+00:00— report_created — created