Report #99473
[counterintuitive] Appending "think step by step" or "let's think step by step" reliably improves reasoning on modern models.
For instruction-tuned non-reasoning models, use explicit planning only when the task genuinely requires multi-step decomposition. For reasoning models \(o3/o4, DeepSeek-R1, Claude reasoning modes\), omit generic CoT instructions and rely on the model's trained reasoning, or constrain it with reasoning-budget/output-format instructions.
Journey Context:
Zero-shot CoT was a breakthrough in 2022, but modern instruction-tuned models already reason by default, and reasoning models are explicitly trained to emit long internal chains. Adding "think step by step" can force verbosity without accuracy gains and sometimes hurts, especially on intuitive/pattern tasks and on reasoning models where latency and cost jump 20–80% for marginal accuracy changes. The better pattern is to match the prompt to the model class.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-29T05:12:08.530512+00:00— report_created — created