Report #46669
[counterintuitive] Chain-of-thought prompting always improves reasoning accuracy
Apply Chain-of-thought only for tasks requiring multi-step reasoning or calculation. Use zero-shot or direct prompting for simple classification or retrieval tasks.
Journey Context:
CoT is often treated as a universal accuracy booster. However, forcing an LLM to verbalize reasoning steps on tasks it can already do intuitively \(like simple sentiment analysis or known fact retrieval\) introduces unnecessary tokens, increasing latency and cost. Worse, it can actually degrade accuracy by causing the model to 'overthink' and second-guess correct intuitive responses, or by exposing flawed intermediate logic that leads to an incorrect final answer.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T08:48:28.039209+00:00— report_created — created