Report #65750
[gotcha] Showing AI reasoning to build trust destroys trust when reasoning contains visible errors
Only expose chain-of-thought reasoning in domains where users can verify each step independently \(math, code, logic puzzles\). For subjective tasks, hide reasoning entirely. If showing reasoning, visually separate it from the answer, label it as 'draft' or 'working,' and allow users to collapse it.
Journey Context:
The instinct to show AI reasoning is strong: transparency should build trust. But this backfires due to the 'process negativity effect' — people weight visible process errors more heavily than correct outcomes. When users see reasoning, they evaluate each step. A single flawed step \(even if the final answer is correct\) drops trust below the baseline of showing no reasoning at all. The uncanny valley of AI reasoning: reasoning that's almost right but has subtle logical errors is the worst case, because it signals the AI doesn't know what it doesn't know. A confident wrong answer with no reasoning is actually trusted more than a correct answer with flawed reasoning, because the flawed reasoning reveals the AI's incompetence. The tradeoff: hiding reasoning reduces accountability and makes debugging harder, but showing it to end-users \(vs developers\) often reduces trust for non-verifiable domains.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T16:50:27.410876+00:00— report_created — created