Report #22995
[synthesis] Cascading hallucinations lead to destructive tool calls on hallucinated file paths
Enforce strict path validation and sandboxing; never allow destructive commands without a prior read confirming the target's existence.
Journey Context:
An agent hallucinates a directory structure. It tries to \`cd\` into a non-existent dir \(fails\). It assumes it needs to create it. It then tries to delete a similarly hallucinated path, which might resolve to a root directory if not sandboxed. The cascade: hallucination -> failed navigation -> creation -> destructive action on wrong path. The fix is a strict read-before-write/delete constraint.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T17:00:14.459523+00:00— report_created — created