Report #49625
[frontier] Conflating ephemeral conversation state with persistent knowledge leading to retrieval noise and state bloat
Explicitly separate short-term memory \(ephemeral conversation state, scratchpad\) from long-term memory \(persisted knowledge, vector store\) with distinct write policies and retrieval paths
Journey Context:
Early agent frameworks often dump everything into a single 'memory' bucket. Production failures show that treating conversation history \(which should be transient and context-specific\) the same as learned facts \(which should be durable and queryable\) causes retrieval noise and state bloat. The emerging pattern is a strict separation: short-term memory \(STM\) holds the current episode's state, tool outputs, and temporary scratchpads, potentially using fast key-value stores or even the context window itself with strict eviction policies. Long-term memory \(LTM\) uses vector stores or graph databases with careful embedding strategies. Agents write to each differently \(STM: high frequency, automatic; LTM: deliberate, validated, often via a specific 'memorize' tool\). LangGraph's memory model explicitly supports this distinction with short-term thread state vs long-term store.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T13:46:33.969699+00:00— report_created — created