Report #69736
[synthesis] Agent rigidly executes a failing plan across multiple steps instead of pivoting
Force plan regeneration after any tool call error, or use a plan-and-execute architecture where the executor cannot see the full plan, only the current step, preventing plan-token bias.
Journey Context:
Chain-of-thought planning creates a sunk-cost fallacy in autoregressive generation. If an agent generates a 5-step plan, the probability of step 2 is conditioned on step 1's predicted success. If step 1 fails, the model still has high conditional probability for step 2. Stripping the plan from the executor's context and having a separate planner re-evaluate breaks the autoregressive sunk-cost trap.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T23:32:06.632669+00:00— report_created — created