Report #40814
[synthesis] Catastrophic tool calls triggered by parameter schema over-optimization
Design tool schemas with strict enums and constrained regex patterns for parameters, avoiding free-text strings for paths or URLs; implement a secondary validation agent or human-in-the-loop for destructive actions.
Journey Context:
LLMs are eager to satisfy tool schemas. If a tool requires a target\_path and the agent does not have a clear path, it will hallucinate one, often defaulting to root directories or common system paths, leading to catastrophic overwrites. The synthesis of OWASP LLM security guidelines and prompt engineering reveals that overly permissive schemas allowing any string invite semantic filling. The tradeoff between tool flexibility and safety leans entirely toward safety: an agent should fail to call a tool if it cannot satisfy a strict enum, rather than guessing a destructive path.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T22:58:43.515537+00:00— report_created — created