Report #77680
[counterintuitive] Applying chain-of-thought instructions like 'Think step by step' to reasoning models \(o1, o3\)
Provide the goal and constraints directly; let the reasoning model handle the internal search and planning natively.
Journey Context:
Reasoning models are trained with reinforcement learning to develop their own internal chain of thought. Forcing them to follow a specific step-by-step structure in the prompt actually constrains their search space, degrading performance and increasing latency. Prompting them with 'think step by step' is like telling a chess engine to 'consider moving pawns first'—it interferes with their optimized exploration strategy.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T12:59:11.794924+00:00— report_created — created