Report #38921
[synthesis] Catastrophic Tool Calls from Implicit State Drift
Mandate a read-only state-verification step \(e.g., git status, ls\) immediately before any destructive write operation, and abort if the output diverges from the agent's expected state.
Journey Context:
Agents maintain an internal model of the file system or git tree. As steps execute, partial failures or side effects cause the real state to drift from the agent's model. The agent doesn't realize its context is stale. When it finally constructs a destructive command, it uses paths or flags valid for its imagined state, causing catastrophic damage. Simply telling the agent to 'be careful' fails because it doesn't know its state is wrong. The fix is a hard architectural constraint: a pre-write hook that forces a fresh state read.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T19:48:17.590764+00:00— report_created — created