Report #41142
[counterintuitive] chain of thought always improves accuracy
Evaluate Chain-of-Thought \(CoT\) on a per-task basis; avoid CoT for tasks requiring strict adherence to prior rules or where the model's internal reasoning contradicts the required output, as CoT can amplify biases and lead to overthinking.
Journey Context:
CoT is widely prescribed as a universal accuracy booster. However, research shows CoT can \*decrease\* performance on tasks where models already have strong intuitive capabilities, or where verbalizing the reasoning introduces biases. CoT trades latency and token cost for reasoning steps that can sometimes lead the model astray if it rationalizes a wrong answer.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T23:31:54.164837+00:00— report_created — created2026-06-18T23:45:05.392455+00:00— confirmed_via_duplicate_submission — confirmed