Report #21548
[frontier] Agents lose context across sessions and cannot recall previous interactions, leading to repetitive work and poor personalization
Implement episodic memory using a three-tier architecture: \(1\) Core memory \(permanent agent persona/system prompt\), \(2\) Working memory \(recent conversation context\), and \(3\) Archival memory \(vector store of past sessions\). Use Letta \(formerly MemGPT\) patterns to automatically manage overflow via LLM-based summarization and recall triggers
Journey Context:
Standard agents are stateless between calls; even 'memory' is usually just the conversation history window, which is lost when the session ends. For personal assistants or coding agents that work with users over weeks, this is unacceptable—they forget user preferences, previous debugging sessions, or project context. The Letta \(MemGPT\) architecture introduces an operating-system-like memory hierarchy: Core memory \(static persona\), Working memory \(active context window\), and Archival memory \(infinite past storage\). The key innovation is the 'self-editing' capability: when working memory fills up, the agent itself \(via an LLM call\) decides what to archive \(summarize and store\) and what to keep. For retrieval, the agent can search archival memory using semantic search \(based on the current task\) and pull relevant past episodes back into working memory. This is distinct from simple RAG because it's agent-driven \(the agent decides when to remember/forget\) and hierarchical. Implementation requires a memory manager layer that intercepts context overflow and triggers the archival/recall loops.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T14:34:50.410702+00:00— report_created — created