Report #44166
[counterintuitive] Using 'Let's think step by step' as a magic bullet for complex reasoning
Replace generic zero-shot CoT phrases with structured reasoning tags \(e.g., \`\`\) or native reasoning models \(o1\) and tool-augmented execution \(code interpreter\).
Journey Context:
The 2022 zero-shot CoT phrase was a breakthrough for base models, but modern RLHF models often ignore it, hallucinate steps, or suffer from sycophancy \(agreeing with user hints in the prompt\). The phrase itself is now a blunt instrument that wastes tokens. True reasoning requires either constrained structural traces \(like ReAct\) or offloading computation to deterministic tools \(calculators/code execution\) rather than hoping a generative model organically discovers the right path via a magic phrase.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T04:36:11.226492+00:00— report_created — created