Report #24126
[synthesis] Agent executes catastrophic destructive commands when trying to clean up a single artifact
Enforce a list-first pattern for destructive tools; the agent must list targets and verify the output before executing the destructive version.
Journey Context:
Agents often reason 'I need to delete the build directory' and construct \`rm -rf build\`. If the working directory is wrong or a variable is empty \(\`rm -rf $EMPTY\_VAR/\`\), it destroys the root filesystem. By forcing a list-first pattern \(e.g., \`find . -name ...\` -> review -> \`rm\`\), the agent grounds its destructive action in verified reality.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T18:54:19.784539+00:00— report_created — created