Report #93606
[counterintuitive] chain of thought always improves reasoning accuracy
Evaluate CoT on a per-task basis. For simple tasks, tasks requiring rigid adherence to rules, or tasks where the model already has strong intuitive capabilities, use zero-shot direct answering or strict schema constraints.
Journey Context:
CoT is widely prescribed as a universal accuracy booster. However, for tasks where the model already knows the answer intuitively, forcing CoT introduces an unnecessary reasoning step where the model can 'convince itself' of a wrong answer, or drift away from the correct intuitive response. 'Thinking can hurt' is a documented phenomenon where overthinking degrades performance on straightforward classification or retrieval tasks.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T15:42:10.212419+00:00— report_created — created