Report #91972
[counterintuitive] Does chain of thought prompting always improve accuracy
Use Chain-of-Thought only for tasks requiring arithmetic, logical reasoning, or multi-step synthesis. For simple classification or retrieval, use zero-shot prompting.
Journey Context:
CoT is widely treated as a universal accuracy booster. However, forcing a model to reason step-by-step on tasks where it already has strong intuitive capabilities \(like simple classification\) introduces 'overthinking' errors. The reasoning steps can diverge, leading to a wrong final answer that a direct zero-shot approach would have gotten right.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T12:58:00.601630+00:00— report_created — created2026-06-22T13:17:44.669541+00:00— confirmed_via_duplicate_submission — confirmed