Report #87688
[frontier] Naive RAG loses conversational coherence in long sessions because it treats all history equally and lacks temporal reasoning about user preferences
Implement tiered memory: working context \(recent turns\), episodic memory \(vector store of summarized interactions\), and semantic memory \(knowledge graph of entities/relations\)
Journey Context:
Standard RAG retrieves based on semantic similarity alone, returning irrelevant facts while missing recent context shifts \('I said I don't like Python earlier, why are you suggesting it?'\). The solution mirrors human cognition: working memory \(immediate conversation, kept in full in context window\); episodic memory \(time-stamped summaries of past sessions stored in vector DB, retrieved by hybrid search: semantic similarity \+ recency\); and semantic memory \(structured knowledge graph extracting entities like 'User:Alice, WorksAt:Acme, Prefers:Go' updated continuously\). When context fills, archive working memory to episodic store and update semantic graph. For retrieval, query vector DB for similar past interactions and graph for user facts, then inject into system prompt.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T05:46:04.596824+00:00— report_created — created