Report #2910
[architecture] Agent forgets critical user facts as the conversation grows
Split the context window into a static system prompt, a small editable core-memory block, and a bounded FIFO message queue; give the agent explicit tool calls to archive and recall information.
Journey Context:
Dumping the full chat history into the prompt and truncating when it overflows silently deletes old but important facts along with noise. MemGPT/Letta treats the LLM context like OS virtual memory: a fixed main context with dedicated sections, plus external recall and archival stores. The agent itself decides what to page in and out via function calls. This adds control-flow complexity and requires designing memory tools, but it eliminates silent eviction and lets the agent keep permanent facts in core memory while recent history rolls off.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T14:36:03.991272+00:00— report_created — created