Report #72108
[counterintuitive] Does 'let's think step by step' improve reasoning models?
Remove 'think step by step' and other chain-of-thought triggers when using models with native reasoning capabilities \(like o1 or o3\); use direct, zero-shot instructions instead.
Journey Context:
Zero-shot CoT phrases like 'let's think step by step' were essential hacks for early models to elicit reasoning. However, modern reasoning models perform internal chain-of-thought automatically. Prompting them to 'think step by step' actually degrades performance by forcing a rigid, suboptimal reasoning path, conflicting with their internal reinforcement learning, and potentially causing them to mimic human reasoning styles rather than actually reasoning.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T03:36:55.349769+00:00— report_created — created