Report #52259
[counterintuitive] chain of thought always improves accuracy
Apply chain-of-thought prompting selectively for complex reasoning tasks; avoid it for simple, highly memorized tasks or tasks requiring strict, low-latency formatting.
Journey Context:
CoT is widely adopted as a default prompt technique to improve accuracy. However, for tasks the model has already mastered, CoT forces it to generate intermediate steps, increasing the surface area for logical errors or 'over-thinking' where the model talks itself out of the correct answer. It also drastically increases latency and token usage. CoT is a tool for allocating compute, not a universal accuracy booster.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T18:12:34.277881+00:00— report_created — created