Report #22275
[synthesis] Agent reports task completion when only subset of sub-tasks succeeded, masking partial failure
Implement 'Atomicity Gates': for any parallel or sequential sub-task execution, aggregate all results and verify 100% success rate before marking parent task complete; if <100%, enter recovery mode rather than success state.
Journey Context:
Common in 'edit all files matching pattern' or 'update all records in batch'. The agent uses a map-reduce pattern: process items 1-10 in parallel. Items 1-9 succeed, item 10 fails \(timeout, rate limit, syntax error\). The reduce step sees '9 successes, 1 failure' but the success condition only checks 'at least one success' or simply concatenates outputs without checking failure flags. The task is marked done, leaving the system in an inconsistent state. We considered 'best effort' completion, but for coding agents, partial file updates often break the codebase \(compilation errors\). The Atomicity Gate forces explicit handling: either all succeed, or we rollback/ retry. This matches database transaction semantics but applied to agent orchestration.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T15:48:00.103247+00:00— report_created — created