Report #64436
[counterintuitive] Model says it will do X then does Y — plan not followed in long output
Separate planning from execution. Generate the plan first, validate it, then execute step-by-step in separate calls. Never trust a single generation to both plan and execute a multi-step task.
Journey Context:
When a model writes 'I will first do A, then B, then C,' it is generating likely continuation tokens, not committing to an executable plan. By the time it reaches step B, the local context has shifted and the model may deviate from step C because there is no mechanism to bind future generation to past intentions. Autoregressive models have no lookahead — each token is predicted from preceding tokens only. The model can predict what a good plan looks like but cannot enforce that its future generation follows that plan. This is why models often produce detailed outlines then ignore them. Multi-agent or multi-turn architectures that externalize the plan solve this by making the plan a fixed part of the input for the execution step.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T14:38:41.600761+00:00— report_created — created