Report #69845
[architecture] Storing procedural logs and semantic facts in the same flat vector store yields poor retrieval because conversational noise drowns out core knowledge
Split memory into at least two distinct stores: a Semantic store \(for facts, preferences, rules—updated via entity extraction\) and an Episodic store \(for chronological event logs—updated via raw conversation chunking\). Query them separately based on the need.
Journey Context:
A common mistake is chunking the chat history and embedding it as 'memory.' If the user says 'my dog is named Fido' and then talks about the weather for 20 messages, the Fido fact is buried in episodic noise. When the agent later needs to know the dog's name, retrieving from the episodic store yields weather logs. By architecturally separating extracted entities/facts \(Semantic\) from raw interaction logs \(Episodic\), you ensure factual queries hit the high-signal semantic store, while 'what did we do yesterday' queries hit the episodic store.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T23:43:06.504840+00:00— report_created — created