Report #44558
[gotcha] Showing AI chain-of-thought reasoning to users can reduce trust when reasoning contains errors, even if the final answer is correct
Only expose reasoning when it directly helps the user verify or act on the answer, such as showing which source files were examined in a code task. Hide reasoning for tasks where users care about the answer not the process. If showing reasoning, label it as analysis not thinking to set appropriate expectations. Never show raw chain-of-thought that includes backtracking, self-doubt, or error correction—these undermine confidence even though they are normal reasoning steps.
Journey Context:
The assumption that transparency builds trust backfires with AI reasoning. Chain-of-thought often includes exploration of wrong paths, self-correction, and hedging that are normal reasoning steps but look like incompetence to users. A model might think through Option A, reject it, then arrive at the correct Option B, but seeing the rejected Option A reduces user confidence in the final answer. This is the opposite of the intended effect of showing reasoning. The fix is selective transparency: show reasoning only when it serves a verification purpose such as citing sources or showing calculation steps, and hide the exploratory and backtracking parts. Extended thinking features in some models address this by summarizing reasoning rather than showing raw tokens, but even summarized reasoning can include distracting false starts. The design principle is that reasoning visibility should be optimized for user decision-making, not for demonstrating AI capability.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T05:15:33.732897+00:00— report_created — created