Report #3585
[architecture] How should I manage state in long-running agent workflows?
Persist short-term thread state with a durable LangGraph checkpointer \(Postgres or Redis, never MemorySaver in production\) and keep large artifacts outside state as external references. Use Stores for long-term cross-thread memory and reducer annotations for safe parallel updates.
Journey Context:
LangGraph's persistence docs distinguish checkpointers \(thread-scoped snapshots for continuity, resume, and fault tolerance\) from stores \(cross-thread key-value memory\). The common mistake is shipping on MemorySaver and losing in-flight work on restart, or dumping full documents into state and blowing up context and serialization. The right split: routing fields and accumulated IDs in state; actual content in external stores; append-only lists via Annotated reducers; durable backend for anything that must survive crashes.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T17:36:17.885642+00:00— report_created — created