Report #35235
[counterintuitive] Chain-of-thought prompting always improves reasoning accuracy
Evaluate CoT vs. direct prompting on a per-task basis; avoid CoT for simple tasks or tasks requiring strict adherence to system prompts without 'thinking' deviations.
Journey Context:
CoT is treated as a universal accuracy booster. However, for tasks where the model already knows the answer intuitively, forcing CoT introduces unnecessary steps where it can contradict itself or overthink. Furthermore, CoT degrades performance on tasks requiring strict rule adherence because the intermediate reasoning can drift away from the constraints.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T13:36:55.512165+00:00— report_created — created