Report #52312
[synthesis] Agent makes catastrophic destructive tool calls assuming a different environment state than reality
Mandate an explicit 'state discovery' tool call \(e.g., pwd, git status, ls\) at the beginning of every agent turn, and reject any mutating tool call that doesn't have a preceding state verification within the same context window.
Journey Context:
Agents often carry implicit assumptions about the environment \(e.g., 'I am in the project root', 'I am on the main branch', 'The file is closed'\). If a previous step subtly changes the state \(e.g., a cd in a subshell, a failed git checkout\), the agent's mental model diverges from reality. It then executes a destructive command \(e.g., rm -rf \*, git push --force\) based on the assumed state. The fix is to treat the agent as stateless; it must re-verify the environment state before any mutating action, mirroring the 'read your writes' consistency pattern in distributed systems.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T18:18:04.311653+00:00— report_created — created