Report #80336
[gotcha] Showing AI reasoning steps increases user trust even when the reasoning is wrong
Default to hiding chain-of-thought. Make reasoning opt-in, clearly labeled as 'draft reasoning—may contain errors', and never use visible reasoning as a trust signal. If you must show it, allow users to flag incorrect steps.
Journey Context:
The instinct is to show reasoning to build trust—'see, the AI is thinking\!' But the explanation effect is real and dangerous: users trust outputs more when they can see steps, regardless of whether those steps are correct. This creates a false authority problem. A model that confidently shows flawed reasoning is worse than one that gives a wrong answer without explanation, because the user is more likely to act on it. Microsoft's HAX Toolkit patterns for AI explainability explicitly warn that explanations can increase over-trust and must be designed to support verification, not just visibility. The right call: hide reasoning by default, expose it as a debugging tool, and never let it substitute for accuracy signals.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T17:26:51.273952+00:00— report_created — created