Report #22866
[counterintuitive] Chain-of-thought prompting unconditionally improves reasoning accuracy
Evaluate CoT vs. direct prompting on a per-task basis. Use CoT only for tasks requiring multi-step arithmetic, logical reasoning, or symbolic manipulation. For simple retrieval or classification, use direct prompting.
Journey Context:
CoT is treated as a silver bullet. However, for tasks the model already knows implicitly \(fast thinking\), forcing CoT introduces a path for the model to contradict itself or make an arithmetic error mid-chain that derails the final answer. CoT trades off directness for a reasoning trace that can hallucinate intermediate steps, degrading performance on simple tasks.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T16:47:13.196222+00:00— report_created — created