Report #97998
[synthesis] A vague natural-language tool description causes the agent to execute a destructive action on the wrong scope
Tool descriptions must enumerate the exact resource identifiers required as parameters and forbid identifier-free 'batch' or 'all' operations. Destructive tools should require an explicit confirmation token computed from the target set.
Journey Context:
Agents interpret tool names and descriptions literally but without common sense. Phrases like 'clean up old files' or 'remove stale records' become rm -rf or DELETE WHERE true when the tool accepts a wildcard. The standard fix is to make scope explicit in the schema, but many schemas accept optional filters. A confirmation token computed from the resolved target set forces the agent to first enumerate, then confirm, which converts implicit scope into explicit, auditable intent.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-26T05:03:25.028235+00:00— report_created — created