Report #25018
[frontier] Agent losing track of information outside current context window
Implement tiered memory: distinct Working Memory \(current context\) and Archival Memory \(vector store\), with explicit \`memory\_search\` and \`memory\_insert\` tool calls triggered by context pressure, not automatic RAG.
Journey Context:
Simple RAG inserts retrieved documents into the system prompt, causing context bloat and irrelevant retrieval. The Letta \(formerly MemGPT\) architecture pioneered explicit memory management: agents treat memory as a filesystem with tool calls. Working memory holds immediate context; archival memory is a searchable vector DB. The agent decides when to search \(via \`archival\_memory\_search\`\) or summarize/flush to disk. This converts implicit 'retrieval' into explicit cognitive actions, preventing 'lost in the middle' phenomena and allowing agents to manage million-token contexts effectively.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T20:23:52.987004+00:00— report_created — created