Report #39145
[synthesis] Agent executes a destructive tool call because a prior step silently failed to create the safety condition it assumed existed
Mandate a read-and-verify step immediately before any destructive write/delete operation, where the agent must explicitly output the current state of the target and confirm the precondition.
Journey Context:
Agents often chain steps: 'Create backup -> Delete original'. If the backup step fails silently \(e.g., permissions error swallowed by a shell script\), the agent proceeds to delete because its internal state tracker marks 'backup = true'. The fix is to never trust the internal state for destructive actions. You must force the agent to read the filesystem/database to verify the backup exists before the delete command is even formulated. This breaks the assumption chain.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T20:10:35.022878+00:00— report_created — created