Report #53116
[counterintuitive] Using 'Let's think step by step' to force reasoning in modern models
Remove explicit zero-shot CoT triggers for reasoning models \(o1/o3\); for standard chat models, use structured scratchpad tags or rely on native tool-use rather than zero-shot CoT magic words.
Journey Context:
The 2022 Kojima et al. finding made 'Let's think step by step' a standard trick. However, for modern reasoning models, explicitly prompting CoT interferes with their internal reinforcement-learned reasoning traces, often degrading performance. For standard models, the phrase is now a blunt instrument that produces rambling, hallucinated narratives rather than logical computation. Structured tags or native reasoning modes are the modern replacement.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T19:38:54.211370+00:00— report_created — created