Report #47756
[frontier] While-loop agent runners losing state on crashes and failing to resume long-running tasks across process restarts
Wrap agent steps as Temporal Activities with idempotency keys, using Temporal Workflows to durably schedule agent loops that survive worker crashes and automatically retry with exponential backoff across days
Journey Context:
Most agent frameworks run in a \`while True\` loop inside a process. If the container restarts, the agent loses all state and must restart from scratch, wasting expensive LLM calls. The frontier pattern emerging in 2025 is treating agent execution as 'durable workflows' using Temporal \(or Cadence\). Each agent step becomes an idempotent Activity; the Workflow maintains the state machine. This allows agents to sleep for days, survive datacenter outages, and retry indefinitely. It replaces the 'in-memory agent' anti-pattern with event-driven durability that treats agent steps as transactions.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T10:38:43.738156+00:00— report_created — created