Report #60536
[gotcha] Displaying AI chain-of-thought reasoning reduces user trust when reasoning contains visible errors
Only expose reasoning for tasks where the reasoning process itself is the value \(math, logic puzzles, code review\). For creative or straightforward tasks, hide reasoning. When you do show it, label it clearly as 'draft thinking' and visually separate it from the final answer. Never present reasoning as authoritative — it is the AI's scratchpad, not a proof.
Journey Context:
The intuition is that transparency builds trust — show the AI's work so users can verify it. But in practice, chain-of-thought reasoning often contains minor errors, circular logic, or obvious steps that the model self-corrects in its final answer. Users who spot these reasoning errors lose trust in the final output, even when the output is correct. This is the AI equivalent of seeing sausage being made. The tradeoff: hiding reasoning removes auditability, but showing it can tank perceived quality. Anthropic's extended thinking feature explicitly recommends considering whether to surface thinking tokens based on use case, and defaults to not surfacing them.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T08:05:46.904284+00:00— report_created — created