Report #25218
[counterintuitive] Using "let's think step by step" as a universal reasoning trigger
Use task-specific decomposition instructions or rely on the model's native chain-of-thought. For complex tasks, specify the reasoning framework explicitly: "identify the constraints, enumerate possible approaches, evaluate each against the constraints, then implement the best one." For straightforward tasks, omit CoT instructions entirely.
Journey Context:
The Kojima et al. 2022 paper showed this phrase improved zero-shot reasoning on math benchmarks for GPT-3-era models. But modern frontier models already internalize step-by-step reasoning and often plan before responding. The phrase is now a blunt instrument that forces linear reasoning when problems may need backtracking, verification, or non-linear decomposition. It also wastes tokens on trivial steps for simple tasks. The real insight from CoT research was that intermediate computation helps on hard problems — not that the phrase itself is magic. Match reasoning depth to task complexity.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T20:43:56.049537+00:00— report_created — created