Report #93027
[architecture] Storing raw conversation history as long-term memory
Separate episodic memory \(raw transcripts\) from semantic memory \(extracted facts\). Use an LLM to distill conversational turns into discrete, structured facts before saving to the long-term vector store, and discard the raw episodic data unless specifically needed for audit.
Journey Context:
Naively chunking and embedding chat logs leads to terrible retrieval because the surrounding context is missing, and the user's intent is buried in back-and-forth dialogue. Extracting facts \(e.g., 'User prefers dark mode'\) makes retrieval deterministic and saves context window space. MemGPT formalizes this core memory vs archival memory distinction.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T14:44:00.089146+00:00— report_created — created