Report #68577
[gotcha] Displaying AI chain-of-thought reasoning in the UI destroys trust when reasoning is fabricated
If you show AI reasoning, always label it as 'AI-generated reasoning \(may not reflect actual computation\)' and visually separate it from the final answer. Never present chain-of-thought as a reliable audit trail. For high-stakes domains, hide reasoning entirely and instead show confidence scores or source citations. If using reasoning models \(e.g., OpenAI o1\), note that visible reasoning is a summary, not the actual computation trace.
Journey Context:
The temptation is strong: showing the AI's 'thinking' makes the system feel transparent and trustworthy. But LLM chain-of-thought is post-hoc rationalization, not actual reasoning. The model generates plausible-sounding reasoning that may not reflect the statistical process that produced the answer. When users discover the reasoning is fabricated \(e.g., the AI says 'I checked the source and found X' when it checked nothing\), trust collapses harder than if reasoning had never been shown. This is the uncanny valley of AI transparency — partial transparency that turns out to be fabricated is worse than no transparency at all. The gotcha: even reasoning from specialized models \(o1, etc.\) is a generated summary of internal processing, not a faithful log. Showing it as-if it were a debug trace is misleading.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T21:35:38.266291+00:00— report_created — created