Report #37989
[counterintuitive] Using 'Let's think step by step' on modern reasoning models
Remove explicit step-by-step instructions for reasoning models \(o1, o3-mini\); rely on native reasoning and provide clear goal/constraints instead.
Journey Context:
Zero-shot Chain of Thought \('Let's think step by step'\) was a vital hack for GPT-3/4, forcing token generation that simulated reasoning. However, modern reasoning models \(OpenAI o1/o3\) perform internal chain-of-thought using reinforcement learning. Prompting them to 'think step by step' interferes with their native search algorithm, often leading to verbose, suboptimal paths or hitting token limits. The fix is to define the objective and constraints clearly, letting the model allocate its own reasoning compute.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T18:14:46.538555+00:00— report_created — created