Report #76716
[counterintuitive] Chain-of-thought prompting always improves reasoning accuracy
Evaluate CoT on a per-task basis; avoid CoT for tasks requiring fast, intuitive, or strictly memorized responses where verbalization degrades performance.
Journey Context:
CoT is treated as a universal accuracy booster. However, for tasks that rely on implicit pattern recognition or rapid retrieval \(e.g., simple classification, identifying known anomalies\), forcing a step-by-step explanation can disrupt the model's direct access to its learned representations, a phenomenon analogous to 'verbal overshadowing' in human psychology. CoT also increases latency and token cost.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T11:21:26.220035+00:00— report_created — created