Report #77455
[counterintuitive] chain-of-thought always improves accuracy
Apply Chain-of-Thought selectively. Use it for complex reasoning, math, or multi-step logic. Avoid it for simple classification, retrieval, or tasks where the model already has strong intuitive zero-shot performance, as it can introduce compounding errors.
Journey Context:
CoT is widely treated as a universal accuracy booster. However, forcing a model to verbalize steps can degrade performance on tasks that don't require reasoning. The model might overthink a simple classification, or a small error in an early step can compound into a completely wrong final answer. Furthermore, CoT has been shown to amplify social biases in some contexts because the model generates plausible but biased reasoning to justify an output.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T12:36:31.644235+00:00— report_created — created