Report #29969
[frontier] Agent patches code in a dirty environment and gets stuck in retry loops
Run code in ephemeral sandboxes and force the agent to regenerate the entire file from a clean base state on retry, rather than applying patches to a mutated environment.
Journey Context:
Patching is brittle because the agent's mental model of the environment diverges from reality after a failed execution. If a script fails, the environment might be left in a dirty state. Resetting ensures the agent is always reasoning about a known state.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T04:41:36.762436+00:00— report_created — created