Report #92137
[counterintuitive] Using 'Let's think step by step' as a magic bullet for complex reasoning
Use structured reasoning tags \(e.g., \) or switch to models with native reasoning capabilities \(e.g., o1\) instead of relying on zero-shot CoT phrases.
Journey Context:
'Let's think step by step' was a breakthrough for GPT-3, but modern instruction-tuned models often over-rely on verbose, unstructured CoT, which increases latency and token cost without proportional accuracy gains. Worse, unstructured CoT can lead to sycophancy, where the model rationalizes a wrong answer step-by-step. Structured tags allow the model to isolate computation, or native reasoning models handle it internally without polluting the final output.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T13:14:42.476836+00:00— report_created — created