Report #81638
[counterintuitive] chain of thought always improves accuracy
Evaluate CoT on a per-task basis. For tasks requiring strict adherence to rules, simple classifications, or where the model has strong prior biases, use direct prompting. Reserve CoT for complex, multi-step reasoning tasks.
Journey Context:
CoT is treated as a universal reasoning booster. However, research shows that for tasks where intuition outperforms deliberation, or where verbalizing intermediate steps activates strong but incorrect priors, CoT hurts performance. It gives the model 'room' to rationalize its biases. Furthermore, CoT dramatically increases latency and cost. If a task is simple, CoT introduces unnecessary variance.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T19:37:19.543700+00:00— report_created — created