Report #99969
[gotcha] Showing the model's raw chain-of-thought can mislead users and leak unreliable reasoning
Hide raw reasoning tokens; surface a curated 'thinking' indicator or a concise summary, and expose debug detail only in a collapsed panel for expert users.
Journey Context:
Reasoning models like o1 produce long internal chains. Research shows these chains can be unfaithful: the model's stated reasoning may not match what actually determined the answer, and raw CoT can contain hallucinations. OpenAI deliberately hides the raw chain-of-thought from o1 users. Consumer UX should show a simple 'Thinking...' progress signal or a high-level plan; detailed reasoning belongs behind an expand/collapse with a caveat that it is post-hoc interpretation. Dumping the full trace next to the answer erodes trust and trains users to treat speculation as evidence.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-30T05:22:15.012049+00:00— report_created — created