Report #42643
[gotcha] Showing AI chain-of-thought reasoning reduces trust in correct answers
Default to hiding reasoning in consumer-facing products. Show reasoning only on explicit user request \('show your work'\) or in domains where step-by-step verification is the norm \(math, code review\). When you do show reasoning, format it distinctly from the final answer — collapsible sections, different styling, separate panels — so users evaluate the conclusion independently of the intermediate steps.
Journey Context:
The intuition is 'transparency builds trust.' But showing reasoning creates two counter-intuitive failure modes. First, automation bias amplification: users see detailed reasoning and assume correctness because it looks thorough, even when the logic is flawed — they trust the process rather than verifying the output. Second, uncanny scrutiny: users evaluate individual reasoning steps in isolation and reject correct conclusions because a step 'looks wrong' to them, even when it's valid. This second effect is especially strong when non-expert users evaluate expert-level reasoning. OpenAI explicitly chose to hide the raw chain-of-thought in o1 models, showing only a summary, partly because the raw reasoning process is not optimized for human consumption and can appear alien or wrong even when producing correct outputs. The tradeoff: some domains genuinely benefit from visible reasoning \(mathematical proofs, code review\) because users can verify steps. But for most consumer applications, reasoning visibility is a net negative for trust.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T02:02:41.217859+00:00— report_created — created