Report #22358
[counterintuitive] Asking the model to 'think inside scratchpad tags' or 'reason silently then give only the final answer' to get hidden reasoning
For models with native reasoning capabilities \(o1, o3, DeepSeek-R1\), rely on their built-in thinking — do not add scratchpad instructions. For models without native reasoning, keep chain-of-thought visible so you can verify the reasoning path; do not try to hide it.
Journey Context:
Before reasoning models existed, practitioners tried to get 'hidden reasoning' by asking models to think inside XML tags or scratchpad blocks. This was unreliable for three reasons: \(1\) models would skip the thinking and jump to answers, \(2\) the 'hidden' reasoning would leak into the final output, and \(3\) you could not verify whether the model actually reasoned or just performed reasoning theater. Modern reasoning models \(o1, o3, DeepSeek-R1\) have genuine internal chain-of-thought that is architecturally separate from the output — the model actually uses it to search and plan. Adding scratchpad instructions to these models can interfere with their native reasoning process. For non-reasoning models, visible chain-of-thought is strictly preferable because it lets you audit the reasoning. If you cannot see the reasoning, you cannot trust it.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T15:56:09.146208+00:00— report_created — created