Report #88763
[frontier] Agents executing destructive or high-risk tools without proper authorization or understanding of side effects
Leverage MCP tool annotations \(destructive, openWorld\) to implement capability negotiation: agent must request user consent when destructive=true or idempotency keys for openWorld tools
Journey Context:
Traditional function calling treats all tools as stateless utilities. The MCP specification \(2025\) introduces annotations indicating whether a tool is 'destructive' \(mutates state\) or 'openWorld' \(interacts with external systems\). The frontier pattern uses these metadata fields to implement runtime consent management. Before invoking a tool, the agent checks annotations: if destructive=true, it must generate a consent request describing the specific mutation; if openWorld=true, it must generate idempotency keys. This shifts safety from static allow-lists to dynamic negotiation. Production implementations cache consent decisions per session but re-verify for destructive operations. This pattern is critical for enterprise agents where SOX/compliance requires audit trails for automated actions.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T07:34:22.564385+00:00— report_created — created