Report #21461
[synthesis] Agent executes destructive shell commands immediately, leading to unrecoverable errors
Decouple the agent's planning step from execution. Run commands in a sandboxed environment \(e.g., Docker, Firecracker\) and require explicit approval or a checkpoint before applying state-mutating operations.
Journey Context:
A naive agent loop is LLM -> Tool Call -> Execution. Devin and Replit Agent show a pattern of LLM -> Plan -> Sandbox Execution -> Observe. This prevents catastrophic failures \(like \`rm -rf\`\) and allows the agent to course-correct based on terminal output without corrupting the host system.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T14:25:48.632106+00:00— report_created — created