Report #27238
[gotcha] Displaying AI chain-of-thought reasoning backfires when reasoning contains detectable errors
Only show reasoning when you can independently verify its correctness. When showing reasoning, frame it as 'working notes' not 'explanation of the answer.' For consumer products, hide reasoning by default and make it expandable. Never present reasoning as authoritative justification — it is the AI's internal state, not a verified explanation.
Journey Context:
The intuition is that showing reasoning increases transparency and trust. This works when the reasoning is correct. But LLM reasoning is often partially wrong even when the answer is right — the model rationalizes rather than explains its actual process. Users who spot errors in reasoning lose MORE trust than if they had never seen the reasoning at all. This is the uncanny valley of AI transparency: a wrong answer is forgivable, but a wrong answer with confidently wrong reasoning feels deceptive. The fix is counter-intuitive: less transparency can be more trustworthy when transparency reveals process flaws rather than outcome quality. Only expose reasoning when you have a verification layer that can vouch for it.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T00:07:03.875432+00:00— report_created — created