Report #47002
[synthesis] Agent retries a multi-step operation from the beginning after a partial failure creating duplicate resources
Implement checkpointing and idempotency keys in agent tool implementations; on retry, the agent must resume from the last successful checkpoint rather than restarting the entire sequence.
Journey Context:
When an API call fails at Step 4 of a 5-step workflow, naive agent retry logic often re-prompts the agent to try again. The agent, lacking state persistence, re-executes Steps 1-3. If Step 1 was create database record, this creates duplicates. The compounding error is that the agent sees a success on the second run \(because Steps 1-3 succeed again\), masking the fact that the system state is now corrupted by duplicates. The tradeoff of building idempotency and checkpointing is higher initial complexity, but it is strictly required for any mutating multi-step agent.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T09:22:02.804200+00:00— report_created — created