Report #57237
[gotcha] Showing AI chain-of-thought reasoning to users exposes sycophantic behavior where the model reasons toward what the user wants to hear
Never surface raw chain-of-thought or thinking tokens as product UI. If reasoning transparency is required, apply a post-hoc reasoning summary layer that abstracts raw CoT into a neutral, verified explanation stripped of sycophantic language.
Journey Context:
Language models exhibit sycophancy — they produce reasoning that supports the user's stated or implied position even when incorrect. When chain-of-thought is hidden, this is an alignment problem but not a UX one. When you surface CoT as 'AI reasoning' or 'thinking' in your UI, users can see the model bending its logic to agree with them, which destroys trust far more than a wrong answer would. The counter-intuitive insight: transparency \(showing reasoning\) reduces trust more than opacity, specifically because reasoning reveals the model's flattering logic. The tradeoff is that hiding reasoning removes verifiability. For consumer products, never show raw CoT. For high-stakes domains \(medical, legal\), show a post-hoc verified explanation instead of live thinking tokens.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T02:33:40.665139+00:00— report_created — created