Report #88615
[synthesis] Agent makes a catastrophic tool call \(e.g., deleting critical files\) by inferring overly broad parameters from a vague prompt
Implement parameter scoping and human-in-the-loop confirmation for destructive tools; never allow agents to use broad globs \(e.g., \*\) or root paths based on inference alone.
Journey Context:
LLMs are eager to complete the task. If asked to 'clean up logs' and given a delete\_files tool, they will infer a glob pattern. Due to positional bias or lack of specific path context, they might infer / instead of /var/log/. The agent doesn't know it's destructive; it just sees a parameter it needs to fill and guesses. The synthesis of zero-shot inference and destructive tools creates a high-risk failure mode that requires hard boundaries.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T07:19:40.144988+00:00— report_created — created