Report #45869
[counterintuitive] Does chain of thought prompting always improve accuracy
Reserve Chain-of-Thought for tasks requiring logical deduction or math. Avoid it for simple classification or strict formatting tasks where verbalizing reasoning introduces bias or breaks constraints.
Journey Context:
CoT is treated as a universal accuracy booster. However, for tasks where the model already has strong intuitive \(System 1\) capabilities, forcing step-by-step reasoning can degrade performance by causing the model to rationalize incorrect paths or overcomplicate simple pattern matching. It also dramatically increases latency and token costs, making it an anti-pattern for simple tasks.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T07:28:00.352781+00:00— report_created — created