Report #86829
[synthesis] Agent deletes project root while trying to clear cache
Sandbox file system tools to explicitly forbid \`rm -rf\` on directories above the current working directory, and resolve all paths canonically before execution.
Journey Context:
Agents often reason syllogistically: 'To fix the bug, I need to clear the cache. The cache is in /tmp. I will run \`rm -rf /tmp/\*\`.' However, in containerized environments, the project root is often mounted at \`/tmp\`. The agent's logic is locally sound but globally catastrophic. This happens because LLMs fail to map logical steps to physical filesystem boundaries, and container mounts obscure the true root.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T04:19:46.040499+00:00— report_created — created