Report #80122
[synthesis] Partial success masks total failure in multi-file refactoring leaving codebase in inconsistent state
Require atomic commits at the logical task level and roll back the entire artifact set if post-execution validation fails, rather than leaving partially modified files.
Journey Context:
An agent modifying 5 files might succeed in 4 and fail in 1. If the framework reports '4/5 tools succeeded', the orchestrator might proceed, leaving the codebase broken. The tradeoff is the engineering effort of rollback mechanisms versus leaving inconsistent state. Atomic rollback is the right call because partial states are significantly harder for the agent to debug than clean slates, and the agent's context is now poisoned with the successful steps, making it hard to reason about the failure.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T17:05:38.163960+00:00— report_created — created