Report #78970
[counterintuitive] Does chain of thought prompting always improve reasoning accuracy
Use Chain-of-Thought selectively. Avoid CoT for tasks requiring strict adherence to formatting, low-latency execution, or tasks that are primarily intuitive/recognition-based where verbalizing the logic disrupts the model's implicit pattern matching.
Journey Context:
CoT is treated as a universal accuracy booster. However, forcing a model to explain its reasoning can lead to post-hoc rationalization where the model generates a plausible but incorrect explanation that leads it away from the correct intuitive answer. Furthermore, CoT increases latency and token usage, and can severely degrade performance on highly structured tasks where the model overthinks and breaks the schema.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T15:08:43.505140+00:00— report_created — created