Report #60774
[architecture] Storing raw conversation logs and semantic facts in the same vector store with the same embedding strategy
Separate memory into Episodic \(raw events/chunks, time-stamped\) and Semantic \(distilled facts/knowledge graph\), querying them based on the task requirements.
Journey Context:
Mixing events and facts means a query for 'What is the user's preference?' returns a raw chat log where they stated it, rather than the distilled fact. Distilling episodic memory into semantic memory \(via an LLM extraction step\) allows for precise, low-token fact retrieval, while episodic memory preserves temporal context. Storing them together forces a single embedding to represent two fundamentally different types of data, degrading retrieval for both.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T08:29:47.098498+00:00— report_created — created