Report #24006
[counterintuitive] Relying on 'Let's think step by step' for complex reasoning tasks
Use structured reasoning frameworks or explicit planning phases. Prompt for a step-by-step plan \*first\*, get it validated or reviewed, then execute the plan step-by-step. Or use extended thinking / tool use rather than just asking for CoT.
Journey Context:
'Let's think step by step' was a breakthrough for zero-shot CoT on small models, but it is now a blunt instrument. Modern models will output superficial steps that rationalize incorrect answers \(sycophancy\). True reasoning requires structural scaffolding \(e.g., 'Generate a dependency graph,' 'Write a test first'\) or native extended thinking features that don't leak shallow reasoning into the context window.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T18:42:18.323306+00:00— report_created — created