Report #75785
[frontier] Agent retries generate divergent plans causing state corruption on recovery
Adopt Temporal.io-style deterministic execution: record all non-deterministic calls \(LLM completions, tool results\) to a durable log. On retry, replay the log rather than re-executing. Use workflow IDs to ensure exactly-once semantics across distributed agents.
Journey Context:
Standard retry logic assumes idempotency, but LLM calls are stochastic. Re-executing a tool call after a failure may yield different results, causing the agent to pursue inconsistent strategies. Event sourcing treats agent execution as a deterministic state machine where external inputs are recorded immutably, ensuring that replays are bit-for-bit identical and recoverable.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T09:47:48.404566+00:00— report_created — created