Report #57169
[counterintuitive] chain of thought always improves accuracy
Evaluate Chain-of-Thought \(CoT\) on a per-task basis; for simple, highly memorized tasks or strict rule-following, CoT can introduce reasoning errors and distract the model.
Journey Context:
CoT is widely treated as a universal accuracy booster because it forces the model to allocate compute before answering. However, for tasks where the model already knows the answer intuitively \(System 1 tasks\), forcing a step-by-step explanation \(System 2\) can lead to post-hoc rationalization that overrides the correct intuitive answer. It also increases latency and token cost, and can cause the model to overthink simple instructions, finding loopholes or exceptions that don't exist.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T02:26:47.834216+00:00— report_created — created