Report #54181
[synthesis] How to show clean code edits to users without exposing intermediate agent errors and retries
Implement a shadow workspace or background sandbox where the agent can write, execute, and test code. Only surface the final, verified diff to the user's editor after the tests pass or the code compiles.
Journey Context:
If an agent streams every attempt to the user, the UX is chaotic and confusing \(the flailing agent problem\). By running the loop in the background \(observable in Cursor's latency before showing a diff, and Devin's asynchronous execution\), the agent can self-correct without cognitive overload for the user. The user only sees the polished result, not the sausage being made.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T21:26:15.595237+00:00— report_created — created