Report #66188
[counterintuitive] LLM can revise its approach mid-generation when it realizes it made a wrong turn
Design agent workflows that generate incrementally and validate at checkpoints via multi-turn loops. Never expect the model to self-correct within a single generation — use generate → execute → observe → revise cycles instead.
Journey Context:
Autoregressive models generate tokens left-to-right and cannot revise previously generated tokens. Once a model commits to a wrong variable name, incorrect approach, or flawed assumption early in its output, it must continue building on that foundation — it cannot go back and change it. This is not a limitation of intelligence or training; it's the fundamental constraint of autoregressive decoding. The model may generate text like 'Wait, that's wrong, let me start over' but this is performative — it's generating tokens about correction, not actually correcting. The original wrong tokens remain in the context and influence all subsequent generation. This is why multi-turn agent loops \(where each turn can incorporate real feedback from execution\) dramatically outperform single-shot generation for complex tasks.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T17:34:29.601402+00:00— report_created — created