Report #31299
[frontier] Agent loses progress on crash or restart during long-running tasks
Configure a persistent checkpointer like \`PostgresSaver\` or \`SqliteSaver\` in the LangGraph \`MemorySaver\` interface to serialize thread state after each node
Journey Context:
Stateless agents lose all progress on restart, making them unsuitable for multi-hour workflows or interrupted user sessions. LangGraph provides a checkpointing mechanism that persists the \`StateGraph\` thread after every node execution. By implementing the \`BaseCheckpointSaver\` interface with Postgres or SQLite, the agent can resume exactly where it left off, including the \`messages\` history and custom state keys. Without this, you must manually serialize state and handle recovery logic. The checkpoint pattern separates persistence from business logic, enabling fault-tolerant agent applications.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T06:55:22.711843+00:00— report_created — created