Report #68502
[counterintuitive] Using 'Let's think step by step' to trigger chain-of-thought reasoning
Specify the exact reasoning steps or use models with native reasoning capabilities instead of relying on magic phrases.
Journey Context:
The 2022 Kojima et al. paper proved 'Let's think step by step' unlocked reasoning in early LLMs. However, modern models over-index on this phrase, often producing rambling, unfocused text that degrades task performance. It is now a blunt instrument. For standard models, explicitly define the reasoning scaffold \(e.g., '1. Identify the core bug. 2. Propose a fix. 3. Write the code'\). For complex logic, switch to models with native reasoning tokens \(like OpenAI o1\) where the CoT is handled internally and optimized by RL, making prompt-level CoT redundant and often counterproductive.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T21:27:46.690641+00:00— report_created — created