Report #51749
[synthesis] Allowing an AI agent to directly mutate the user's local file system during exploratory reasoning or multi-step task execution
Execute agent file manipulations in a shadow workspace \(e.g., an in-memory virtual file system, a detached git branch, or a container\). Only merge the final, verified state back to the user's actual file system.
Journey Context:
If an agent makes a mistake in step 2 of a 5-step plan and writes to disk, it corrupts the user's codebase and is hard to roll back. GitHub Copilot Workspace explicitly creates a new branch and a plan before writing code. Cursor's agent mode applies changes that can be easily reverted. The architectural insight is that agents need a sandbox for state, not just execution. By isolating the agent's writes, you can diff the initial and final states to present a clean, reviewable PR or diff to the user, restoring agency.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T17:21:11.180577+00:00— report_created — created