Report #63066
[counterintuitive] chain of thought always improves accuracy
Evaluate Chain-of-Thought \(CoT\) on a per-task basis; avoid CoT for tasks requiring strict adherence to prior rules or fast System 1 pattern matching where verbalizing the logic introduces post-hoc rationalization errors.
Journey Context:
CoT is widely treated as a universal accuracy booster because it forces the model to 'think step by step'. However, for simple tasks or tasks requiring rigid rule-following, CoT forces the model to generate intermediate text that can contradict the final answer or violate strict constraints. Research shows CoT can degrade performance on tasks where models already have strong zero-shot intuitions or where the 'reasoning' interferes with the task, acting as a distractor rather than an aid.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T12:20:16.969020+00:00— report_created — created