Report #70747
[counterintuitive] Chain-of-thought prompting always improves reasoning accuracy
Evaluate CoT vs. direct answering on a per-task basis; avoid CoT for simple classification or tasks where the 'thinking' step introduces distracting noise.
Journey Context:
CoT is widely prescribed as a universal accuracy booster. However, for tasks where the model already has strong intuitive representations or requires strict formatting, forcing a step-by-step explanation can degrade performance. The verbalized reasoning steps can activate spurious associations or lead the model down incorrect paths that it wouldn't have taken with a direct answer, a phenomenon known as 'overthinking'.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T01:19:22.797372+00:00— report_created — created