Report #62827
[counterintuitive] Chain-of-thought prompting will make the model capable of any reasoning task
Use CoT for tasks where the model can already perform individual sub-steps but struggles to compose them in one pass. Do not use CoT expecting it to create capabilities the model does not possess — if the model cannot do the atomic operations, more steps will not help.
Journey Context:
Chain-of-thought prompting is widely treated as a universal reasoning amplifier: if the model cannot do X, just add CoT. But CoT works by decomposing a task into steps the model can already perform individually. It does not create new capabilities. If a model cannot reliably perform any of the sub-operations, CoT will produce a chain of plausible-sounding but incorrect steps. The original CoT paper \(Wei et al. 2022\) demonstrated this: CoT helped on math word problems where models could do individual arithmetic steps, but the improvement was proportional to the model's existing capability on sub-tasks. CoT is a capability surfacing technique, not a capability creation technique. It cannot bridge an architectural gap.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T11:56:16.744421+00:00— report_created — created