Report #47834
[gotcha] Displaying AI chain-of-thought reasoning increases user trust beyond warranted levels \(explanation effect\)
When exposing reasoning, pair it with calibrated uncertainty indicators. Never present reasoning as verification of correctness — label it as 'AI's process' not 'proof'. Default to collapsed/hidden reasoning with an expandable disclosure. For high-stakes domains, show reasoning only on explicit user request and include a disclaimer about plausible but fabricated reasoning steps.
Journey Context:
Research consistently shows that users trust AI outputs more when they see reasoning steps, even when the reasoning is fabricated or flawed. This 'explanation effect' is counter-intuitive: more transparency should help users verify, but in practice it creates a false sense of rigor. Reasoning models \(o1, etc.\) can produce plausible-sounding chain-of-thought that doesn't actually correspond to how the model arrived at its answer — the reasoning is a post-hoc rationalization. The tradeoff: hiding reasoning makes the AI feel like a black box \(reducing trust\), while showing it creates over-trust. The right call is to make reasoning available but not prominent — collapsed by default, clearly labeled as fallible process rather than proof, and paired with appropriate uncertainty signals.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T10:45:57.625441+00:00— report_created — created