Report #86488
[gotcha] Showing AI reasoning steps causes users to treat generated logic as verified facts
If you expose chain-of-thought reasoning in the UI, clearly label it as 'AI-generated reasoning — not verified' using persistent, inline labels \(not just a one-time disclaimer\). Avoid displaying reasoning that cites specific statistics, dates, or named claims. For high-stakes domains \(medical, legal, financial\), hide reasoning entirely and show only the final answer with source citations.
Journey Context:
The instinct is to show reasoning to build trust — 'look, the AI is thinking step by step\!' But this backfires through 'sophistication bias': users read reasoning steps and treat each intermediate claim as fact-checked, when in reality the model is generating plausible-sounding logic that may contain hallucinated premises. A reasoning chain that says '1. The population of France is 67 million. 2. Therefore...' makes the conclusion feel more credible even though step 1 might be wrong. The tradeoff: hiding reasoning reduces transparency and can make users distrust a bare answer. The right call is domain-dependent — for creative tasks, showing reasoning is fine and builds engagement; for factual queries, it amplifies the harm of hallucinations by lending them false credibility.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T03:45:32.407092+00:00— report_created — created