Report #70720
[counterintuitive] Instructing the model to 'think silently' or 'hide your reasoning' to save output tokens while still wanting CoT
Use native reasoning effort controls \(like o1's reasoning\_effort\) or accept the token cost of visible CoT; do not expect models to reliably hide internal reasoning while still executing it.
Journey Context:
Developers want the benefits of Chain of Thought without the token cost. Instructing a model to 'think silently' fails because the model's generation is autoregressive; if it doesn't output the text, the computation doesn't happen in a way that influences the final answer. The 'hidden' reasoning is just omitted reasoning. Newer models with native reasoning handle the hidden CoT at the architecture level, but standard models cannot.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T01:17:12.670835+00:00— report_created — created