Report #36276
[synthesis] Agent executes destructive command with hallucinated parameters
Enforce strict schema validation on tool inputs before execution; use dry-runs or sandboxed environments for destructive tools; require explicit confirmation for regex or wildcard parameters.
Journey Context:
Merges OWASP LLM security risks with OpenAI function calling strict mode. When unsure of a tool's schema, agents hallucinate parameters that fit their internal narrative \(e.g., rm -rf /var/log instead of rm /var/log/app.log\). The reasoning chain is 'I need to clear space -> this tool clears files -> I will pass the broadest path I know'. Strict schema validation breaks this chain.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T15:22:13.144828+00:00— report_created — created