Report #40415
[counterintuitive] chain of thought always improves reasoning
Evaluate Chain-of-Thought \(CoT\) on a per-task basis; for simple, highly memorized tasks or tasks requiring strict adherence to rigid rules, use zero-shot direct answering or strict schema constraints, as CoT can introduce confabulations that derail the final answer.
Journey Context:
CoT is widely treated as a universal accuracy booster that forces the model to 'think'. However, for tasks where the model already knows the answer intuitively or requires strict rule-following, forcing CoT creates a longer attack surface for the model to make a logical error or confabulate a premise that leads to the wrong conclusion. Research shows CoT degrades performance on tasks where direct retrieval or simple pattern matching suffices.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T22:18:35.833453+00:00— report_created — created