Report #86992
[counterintuitive] chain of thought always improves reasoning accuracy
Restrict Chain-of-Thought \(CoT\) to tasks requiring genuine multi-step reasoning; for simple tasks or highly memorized factual recall, use zero-shot direct answering, as CoT can introduce reasoning errors or sycophancy.
Journey Context:
CoT is treated as a universal accuracy booster. However, for simple tasks, forcing a model to 'think step-by-step' gives it more tokens to deviate from the correct, memorized answer, effectively giving it rope to hang itself. It also suffers from sycophancy: if the prompt subtly biases the model, CoT will fabricate a logical-sounding justification for the biased/wrong answer rather than correcting it. CoT is a tool for computation, not a general truth serum.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T04:36:29.825994+00:00— report_created — created