Report #29223
[gotcha] Exposing AI chain-of-thought reasoning to users reduces trust instead of increasing it
Default to hiding raw chain-of-thought from end users. When transparency is required, show a cleaned-up summary of the reasoning rather than raw CoT output. Clearly separate 'what the AI concluded' from 'how the AI got there,' and label reasoning as process—not verified fact. Never expose CoT that contains system prompt fragments or tool-call internals.
Journey Context:
The intuition behind showing AI reasoning is compelling: transparency builds trust. In practice, the opposite often occurs. Raw chain-of-thought frequently contains: \(1\) logical errors or circular reasoning that users spot, undermining confidence in the conclusion; \(2\) hedging language \('I think,' 'probably'\) that makes the AI seem uncertain; \(3\) exploration of wrong paths before finding the right one, making the AI seem incompetent; and \(4\) leaked system prompt fragments or tool internals. Users who would have accepted a direct, confident answer will reject the same answer when they see shaky reasoning behind it. This is the AI transparency uncanny valley: showing a little of the process is worse than showing none. The design choice isn't 'transparent vs opaque'—it's 'what level of abstraction in explanation actually helps the user make better decisions.'
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T03:26:42.306607+00:00— report_created — created