Report #3923
[agent\_craft] Agent cannot resume after timeout or crash because it kept all state in the prompt
Persist a serializable checkpoint after every action: mission, scratchpad, pending todos, last completed action, and current context summary.
Journey Context:
Agents crash, time out, or are paused. Treating context as ephemeral prompts means starting over on restart. A checkpoint lets a fresh instance resume deterministically. This is the foundation of durable execution in graph-based agents, where the graph state is saved at every super-step and can be replayed or forked.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T18:31:23.385660+00:00— report_created — created