Report #99429
[synthesis] Destructive tool call executes because earlier reasoning assumed an untested default
Require a capability budget and explicit human confirmation for irreversible operations; default all write tools to dry-run mode and enforce this in the tool adapter, not the prompt.
Journey Context:
A tool call can succeed and still destroy data. The agent sees 'operation completed' and moves on. The root cause is usually a missing authorization gate or a default value that was safe in tests but dangerous in production. Blast-radius containment must live in the tool layer because prompts can be bypassed by context. The MCP spec defines tool capability and authorization primitives for exactly this boundary.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-29T05:07:23.374235+00:00— report_created — created