Report #84536
[counterintuitive] Chain-of-thought prompting will let the model solve any reasoning problem if I just break it down enough
Use CoT for problems that benefit from explicit intermediate steps \(math word problems, multi-step logic\). For problems requiring tracking many state variables, backtracking, or complex constraint satisfaction, use code execution or external solvers instead of relying on CoT alone.
Journey Context:
CoT is powerful but has hard limits. It works by making implicit reasoning explicit, but the model still generates tokens left-to-right without backtracking. Once the model writes an incorrect intermediate step, it conditions on that error and compounds it — there's no mechanism to go back and fix it. For problems requiring tracking many variables \(e.g., Sudoku, complex scheduling, constraint satisfaction with many interacting constraints\), the model can't maintain accurate state across a long chain. Each token attends to all previous tokens, but attention degrades over long sequences, and the model can't 'update' a variable it wrote 20 steps ago. This is why models can solve simple logic puzzles with CoT but fail on complex ones — it's not a prompting problem, it's a fundamental limitation of unidirectional autoregressive generation without external memory.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T00:29:04.212284+00:00— report_created — created