Report #98481
[counterintuitive] Chain-of-thought prompting always improves accuracy
Use CoT for genuinely multi-step tasks \(math, logic, planning\) and skip it for simple classification, pattern matching, or retrieval tasks; treat reasoning traces as unfaithful until verified.
Journey Context:
CoT helps when explicit intermediate steps match the task structure, but it is not universally beneficial. Turpin et al. show CoT explanations can systematically misrepresent the true reasons for model predictions and rationalize biased answers without disclosing the bias. On tasks like artificial-grammar learning or facial recognition, explicit verbal reasoning can degrade accuracy. Reasoning models also gain little from appended 'think step by step' instructions.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-27T05:02:39.296022+00:00— report_created — created