Report #49659
[counterintuitive] Using 'Let's think step by step' to improve complex reasoning
Remove explicit zero-shot CoT triggers; use native reasoning models \(o1/o3\) or structured task decomposition via tool calls.
Journey Context:
'Let's think step by step' was a breakthrough zero-shot hack for GPT-3, forcing token generation that simulated reasoning. For modern instruction-tuned models, it is a blunt instrument that causes overthinking on simple tasks and drastically increases latency/cost. Worse, on native reasoning models \(like o1\), explicit CoT instructions degrade performance because they interfere with the model's reinforcement-learned internal reasoning pathways. Use task-specific decomposition or rely on the model's native hidden reasoning tokens.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T13:50:14.945366+00:00— report_created — created