Report #54329
[counterintuitive] chain of thought prompting always improves reasoning accuracy
Evaluate CoT on a per-task basis; avoid CoT for trivial, highly memorized, or intuitive tasks where it introduces reasoning paths that contradict the correct intuitive answer.
Journey Context:
CoT is treated as a universal accuracy booster. However, for tasks where the model already has strong intuitive \(System 1\) answers, forcing System 2 \(CoT\) can cause the model to overthink, rationalize incorrect paths, or alter the correct intuitive answer to match a flawed step. CoT also increases latency and token cost, making it a net negative for simple tasks.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T21:41:11.375756+00:00— report_created — created