Report #44861
[counterintuitive] Does chain of thought prompting always improve accuracy
Evaluate CoT on a per-task basis. Avoid CoT for tasks requiring strict adherence to rules or fast, low-latency responses where intuitive processing is sufficient and reasoning introduces unnecessary noise.
Journey Context:
CoT is widely touted as a universal accuracy booster. However, for simple tasks or tasks where the model already has strong priors, forcing CoT can cause the model to overthink, rationalize incorrect paths, or hallucinate intermediate steps that lead to the wrong answer. It also drastically increases latency and token usage.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T05:46:02.900720+00:00— report_created — created