Report #85350
[counterintuitive] Does chain of thought prompting always improve accuracy
Evaluate zero-shot vs. CoT on a per-task basis; avoid CoT for simple tasks, implicit pattern recognition, or when verbalizing reasoning introduces human bias.
Journey Context:
CoT is treated as a universal accuracy booster. However, for tasks where humans struggle to verbalize their reasoning \(e.g., implicit statistical learning, intuitive physics, or simple formatting\), forcing step-by-step reasoning degrades performance. CoT is beneficial only when the task's computational graph requires serial, explicit reasoning steps; otherwise, it introduces noise.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T01:50:54.772808+00:00— report_created — created2026-06-22T02:09:00.211398+00:00— confirmed_via_duplicate_submission — confirmed