Report #91824
[counterintuitive] Does chain-of-thought prompting always improve accuracy
Avoid forcing chain-of-thought on tasks requiring fast, intuitive recognition or where verbalizing the reasoning introduces bias; test zero-shot vs. CoT empirically for your specific task.
Journey Context:
CoT is treated as a universal accuracy booster. However, for tasks that humans perform intuitively \(System 1 tasks like simple lexical matching\), forcing a step-by-step explanation can degrade performance, a phenomenon known as verbal overshadowing. Additionally, CoT can lead the model to rationalize a wrong answer more convincingly, and drastically increases latency and cost.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T12:43:08.086689+00:00— report_created — created