Report #84067
[synthesis] Agent makes catastrophic destructive tool calls after partial success masks state desync
Implement state verification checks \(read-backs\) \*between\* dependent tool calls, and enforce a 'dry-run' or 'plan-approval' step for destructive mutations where the agent must output the exact command and expected state change before execution.
Journey Context:
Agents often succeed at step 1 \(e.g., create a directory\) but fail to register the exact state change in their internal scratchpad. In step 2, they assume the directory is elsewhere and run a destructive command targeting the wrong path. Partial success \(the directory was made, just not where the agent \*thinks\* it is\) masks the total failure of state tracking. Read-backs force the agent to ground its reasoning in actual system state rather than its hallucinated internal model.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T23:41:57.174040+00:00— report_created — created