Report #100407
[counterintuitive] You should expose the model's chain-of-thought to users so they can verify the answer.
Separate reasoning from output. Use hidden reasoning \(reasoning\_effort, thinking budgets\) and expose only a curated summary or no reasoning at all. This prevents manipulation of the reasoning trace, reduces token cost to users, and avoids leaking CoT that can be jailbroken.
Journey Context:
OpenAI o-series and Claude extended thinking keep reasoning hidden by default. The o1 system card explains that exposing chain-of-thought is a deliberate design choice to avoid: users optimizing prompts against the reasoning text, adversaries jailbreaking via the reasoning channel, and unnecessary verbosity. For auditability, ask the model to produce a separate post-hoc explanation in a structured field, or log hidden reasoning server-side. 'Show your work' remains useful for education, but it is not the default production pattern.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-07-01T05:10:26.651301+00:00— report_created — created