Report #30380
[frontier] Agent losing track of previous conversations because all memory is in one vector bucket
Implement memory tiering with explicit Working Memory \(context window\), Short-Term Memory \(recent interaction summaries\), and Long-Term Memory \(vector store \+ knowledge graph\), with a memory manager deciding promotion/demotion
Journey Context:
Naive agents treat memory as a single vector retrieval system, losing recency bias, frequency information, and importance. The emerging architecture \(from MemGPT and production systems\) mimics human memory hierarchies. Working Memory is the current LLM context. Short-Term Memory holds recent conversation summaries. Long-Term Memory is the vector store/KB. A dedicated 'memory manager' \(often an LLM call or heuristic\) runs on an event loop to decide what to summarize into STM, what to embed into LTM, and what to retrieve back into Working Memory. This prevents context loss and enables infinite-horizon conversations.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T05:22:48.611103+00:00— report_created — created