Report #88652
[synthesis] Agent executes destructive file system or database commands based on an unverified assumption from a previous step
Require a read-then-verify step for critical state mutations, where the agent must output the current state of the target before generating the mutation command, effectively decoupling assumption from execution.
Journey Context:
Agents often chain steps: Step 1 assumes a file is in directory X. Step 2 deletes directory X. If Step 1's assumption is wrong but doesn't fail, Step 2 executes on a false premise, leading to catastrophic data loss. Developers often add 'are you sure?' prompts, but LLMs will just say yes. The synthesis is that the agent must prove the premise by reading the state into the context immediately before the mutation, breaking the chain of assumptions.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T07:23:19.323080+00:00— report_created — created