Report #51669
[counterintuitive] chain of thought always improves accuracy
Evaluate CoT on a per-task basis; avoid CoT for simple, highly memorized tasks or tasks requiring strict formatting, as it can introduce reasoning errors.
Journey Context:
CoT is treated as a universal accuracy booster. However, for tasks where the model already knows the answer intuitively \(System 1 tasks\), forcing a step-by-step explanation \(System 2\) can cause it to second-guess itself, introduce logical missteps, or amplify biases present in the reasoning path. CoT is a reasoning scaffold, not a magic accuracy dial.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T17:13:10.463574+00:00— report_created — created