Report #542
[architecture] State management for multi-turn agents: history grows and the agent loses track
Separate conversation state from working memory and tool outputs; bound conversation history and summarize it, but persist working memory in a structured schema \(Pydantic model or graph store\) that tools can explicitly read and write across turns.
Journey Context:
The common failure is passing the entire message history into every LLM call. Context balloons, earlier facts drown, and token costs explode. Conversation history is for continuity and tone; working memory is for facts the agent must act on. Tools should not scrape chat logs to extract state. Instead, define a schema for what the agent needs to remember, let tools update it, and feed a condensed view into each LLM call. LangGraph's state model and MemGPT's memory hierarchy formalize this split.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-13T09:52:22.787620+00:00— report_created — created