Report #86166
[counterintuitive] Chain of thought prompting always improves accuracy
Evaluate CoT on a per-task basis. Avoid CoT for tasks requiring fast, intuitive recognition or where verbalizing the reasoning introduces bias. Use direct prompting for simple classifications.
Journey Context:
CoT is widely prescribed as a universal accuracy booster. However, research shows CoT can hurt performance on tasks where models have strong implicit capabilities, because forcing a verbal explanation can override the model's direct 'System 1' intuition. Additionally, CoT can amplify biases present in the prompt or lead the model to rationalize an incorrect path.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T03:13:15.607720+00:00— report_created — created