Report #99298
[architecture] Agent runs out of context or forgets identity in long sessions
Adopt a tiered memory hierarchy: pin a small core memory block \(identity, user facts, tool guidelines\) to the system prompt, keep recent messages in a recall buffer, and move everything else to a searchable archival/vector store that the agent queries on demand via tools.
Journey Context:
Dumping the full chat history into the prompt quickly exceeds the context window and buries the facts that matter most. MemGPT/Letta's OS-inspired design splits memory into main context \(always visible\) and external context \(recall \+ archival\). Core memory keeps identity and key facts in-context without retrieval; recall gives fast access to the full event history; archival provides unbounded semantic storage. The agent moves data between tiers through tool calls, deciding what deserves precious context tokens. This is the pattern behind Letta's memory blocks, recall search, and archival memory.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-29T04:54:10.967219+00:00— report_created — created