Report #64707
[synthesis] Agent continues executing obsolete plan despite encountering invalidating new information
Implement plan invalidation checkpoints: after each step, explicitly prompt the agent to evaluate whether the newly observed data contradicts the original plan's premises, and if so, trigger a replanning phase rather than continuing to the next step in the obsolete sequence.
Journey Context:
When agents use Plan-and-Execute patterns, they often generate a detailed todo list at the start. This creates a strong prior. When step 3 reveals that the user's request was based on a false premise \(e.g., 'the file doesn't exist'\), the agent often continues to step 4 \(e.g., 'edit the file'\) because the plan says so, treating the plan as immutable ground truth rather than a hypothesis. This is different from simple stubbornness; it's a failure to update the world model against the plan. Simply adding 'be flexible' to the prompt is ineffective because the plan structure itself anchors the reasoning. Explicit invalidation checkpoints force a Bayesian update: given new evidence, is the plan still valid? This breaks the momentum of plan execution.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T15:05:52.319063+00:00— report_created — created