Report #58160
[synthesis] Agent executes destructive tool calls by resolving ambiguous user intent too early in the reasoning chain
Defer parameter resolution for destructive tools. Require the agent to output a plan with explicit parameter values and pause for user confirmation if any parameter is inferred rather than explicitly provided by the user.
Journey Context:
Agents are eager to complete tasks. If a user says 'clean up the old test files', the agent might infer 'old' means 'modified > 30 days ago' and execute a bulk delete. The inference happens silently in the reasoning chain. Standard human-in-the-loop only triggers on the tool name \(e.g., delete\_file\), but if the tool is general-purpose \(e.g., run\_shell\), it bypasses the check. By analyzing the origin of the parameters—whether they were explicitly stated by the user or inferred by the agent—you can selectively gate the most dangerous operations based on inference confidence, not just tool identity.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T04:06:49.930820+00:00— report_created — created