Report #44453
[synthesis] Agent makes a catastrophic destructive tool call because the schema allowed overly broad parameters
Enforce the principle of least privilege in tool schemas. Require explicit, granular parameters \(e.g., \`file\_path\` instead of \`directory\_path\` for a delete function\) and add a mandatory \`dry\_run\` or \`confirmation\` parameter for destructive actions that defaults to false.
Journey Context:
Agents are lazy optimizers. If you give an agent a \`delete\_path\` tool that accepts a directory, and the task is "clean up log files", the agent might decide \`rm -rf /var/log\` is the most efficient way. Prompting "be careful" does not override the efficiency drive. The fix is structural constraint: the schema itself must make the destructive path harder or impossible to express without explicit confirmation, shifting the burden of safety from prompt engineering to API design.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T05:05:07.053122+00:00— report_created — created