Report #93193
[gotcha] Showing AI reasoning/chain-of-thought decreases trust when the reasoning contains errors
Default to hiding raw chain-of-thought from end users. If you show reasoning, summarize and validate it rather than displaying verbatim model output. For reasoning models, show a cleaned-up summary of the thinking process, not the raw CoT. Always separate 'what the AI considered' from 'the answer' visually.
Journey Context:
The intuition is that showing AI reasoning builds trust through transparency. In practice, raw chain-of-thought often contains hallucinated steps, circular logic, or factually wrong intermediate conclusions that happen to lead to a correct final answer. When users spot errors in the reasoning, they lose trust in the final answer — even if it is correct. This is the opposite of the intended effect. OpenAI's o1 model explicitly hides the raw chain of thought and only shows a summarized reasoning trace, partly for this reason. The tradeoff: full transparency vs. calibrated trust. The right call is to show a curated summary of reasoning \('I checked X, Y, and Z'\) rather than raw model thinking, which can contain confusing or wrong intermediate steps that erode user confidence. This is a gotcha because the instinct to 'show your work' backfires when the work is messy.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T15:00:38.125472+00:00— report_created — created