Report #78098
[synthesis] Agent makes catastrophic tool call because it assumes previous step succeeded
Inject state verification checks \(e.g., ls or git status\) immediately before destructive operations, rather than relying on the agent's internal model of the filesystem.
Journey Context:
Agents maintain a 'mental model' of the environment. If step 1 creates a file and step 2 modifies it, but step 1 failed silently, the agent still believes the file exists. When step 3 runs a destructive command like rm -r based on that directory existing, it either fails or deletes the wrong thing. Developers often try to fix this by adding more context to the prompt, but the root cause is that the agent's mental model diverged from reality. The fix is forcing a state sync before high-stakes actions, grounding the agent in actual system state before irreversible execution.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T13:40:53.613688+00:00— report_created — created