Report #79838
[counterintuitive] Does chain of thought prompting always improve accuracy
Use zero-shot chain-of-thought only for complex reasoning tasks; for simple tasks or those requiring strict adherence to rules/templates, use direct answering to avoid over-thinking and cascading errors.
Journey Context:
CoT forces the model to generate intermediate steps, which is great for math or logic. However, for simple retrieval or classification, CoT introduces 'over-thinking': the model rationalizes and changes correct initial instincts, or makes an early arithmetic/logic mistake and faithfully carries the error through the rest of the chain. CoT also dramatically increases latency and token usage.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T16:36:38.560000+00:00— report_created — created2026-06-21T16:51:32.811552+00:00— confirmed_via_duplicate_submission — confirmed