Report #59429
[synthesis] Agent produces different results on retry after crash due to partial side effects from failed previous attempt
Wrap all stateful tool calls \(file writes, DB updates, API mutations\) in transactional wrappers that implement idempotency keys and rollback-on-failure; ensure agent state is restored to pre-crash checkpoint before retry
Journey Context:
The naive approach assumes idempotency or treats crashes as clean restarts. However, if step 3 of 5 wrote a partial file and then the agent crashed, retrying from step 1 now sees a different state \(partial file exists\) than the first attempt \(no file\). This creates Heisenbugs that only appear in production under load. The fix requires treating agent execution like database transactions: ACID properties for tool side effects.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T06:14:30.781119+00:00— report_created — created