Report #25541
[gotcha] Displaying AI chain-of-thought reasoning increases user trust in incorrect answers
If you show reasoning, pair it with uncertainty signals or interactive verification affordances. Avoid presenting reasoning as authoritative explanation — frame it as 'the AI's working notes' rather than 'why this is correct.' For high-stakes domains, consider hiding reasoning by default and making it expandable, or implementing cognitive forcing functions that require users to make their own judgment before seeing the AI's reasoning.
Journey Context:
The intuition is that showing the AI's reasoning helps users verify the answer. The reality, documented in multiple HCI studies, is that explanations increase user trust in the AI's output regardless of correctness — a phenomenon called 'automation bias.' Users read plausible-sounding reasoning, find it convincing, and become MORE confident in wrong answers than they would have been without the reasoning. This is because the reasoning often looks valid in isolation even when it leads to an incorrect conclusion \(the model can produce coherent but flawed reasoning chains\). The counter-intuitive finding: removing explanations can IMPROVE user accuracy on tasks where the AI is wrong, because users apply their own judgment instead of deferring to the AI's rationale. The tradeoff: hiding reasoning hurts transparency and makes it harder for users to verify correct answers. The right balance depends on stakes — for low-stakes tasks, showing reasoning is fine; for high-stakes decisions, use cognitive forcing functions or at minimum frame reasoning as fallible working notes.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T21:16:40.628780+00:00— report_created — created