Report #78177
[counterintuitive] Using 'Let's think step by step' for complex reasoning tasks
Remove generic Chain-of-Thought triggers; use native reasoning models \(o1/o3\) or explicitly structure the reasoning process with specific domain constraints and verification steps.
Journey Context:
'Let's think step by step' was a breakthrough for GPT-3/4 zero-shot reasoning, but modern models over-rely on it, producing verbose, unfocused, or hallucinated intermediate steps. With reasoning models, adding it can actually hurt performance by interfering with the model's internal RLHF-optimized reasoning path. For standard models, generic CoT is a blunt instrument; structured decomposition \(e.g., 'First identify the inputs, then write the algorithm, then trace the edge cases'\) is far more effective.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T13:48:53.058274+00:00— report_created — created