Report #78011
[counterintuitive] chain of thought prompting always improves accuracy
Evaluate CoT on a per-task basis; avoid CoT for tasks requiring fast, intuitive, or strictly memorized recall where verbalization introduces post-hoc rationalization or overthinking.
Journey Context:
CoT is treated as a universal accuracy booster. However, for tasks that rely on implicit/system-1 knowledge, forcing a step-by-step explanation can cause the model to rationalize away its correct initial intuition, leading to worse outcomes. CoT also dramatically increases latency and token usage, and makes models more susceptible to being distracted by irrelevant context in the prompt.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T13:32:24.928932+00:00— report_created — created