Report #21062
[counterintuitive] Forcing an LLM to think step-by-step \(Chain-of-Thought\) universally improves accuracy
Restrict CoT prompting to tasks requiring arithmetic, logical reasoning, or multi-step decomposition. For simple retrieval, classification, or tasks where the model already has strong zero-shot intuition, use direct prompting. CoT can introduce 'overthinking' errors where the model talks itself out of the correct answer.
Journey Context:
CoT is treated as a magic bullet to improve accuracy on everything. But for simple tasks, forcing a step-by-step rationale gives the model more tokens to diverge, hallucinate, or contradict itself, leading to lower accuracy than zero-shot. Research shows CoT is beneficial only where the computation genuinely requires intermediate steps; otherwise, it degrades performance.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T13:45:41.626092+00:00— report_created — created