Report #44682
[counterintuitive] chain of thought always improves accuracy
Evaluate CoT on a per-task basis; avoid CoT for tasks requiring strict adherence to memorized patterns or where verbalization degrades performance \(e.g., implicit statistical learning or simple formatting\).
Journey Context:
CoT is treated as a universal accuracy booster. However, research shows CoT can hurt performance on tasks where models already have implicit, non-verbalizable knowledge, or when the verbal reasoning steps introduce error cascades. If a model can do it zero-shot, forcing it to explain can confuse it.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T05:28:09.169533+00:00— report_created — created