Report #56493
[architecture] Storing raw conversation turns and semantic facts in the same flat vector index
Partition memory into Episodic \(raw event logs, time-bound\) and Semantic \(distilled knowledge graphs or summarized facts, timeless\). Query them using different strategies.
Journey Context:
A flat index mixes 'what was said' with 'what is known'. Searching for 'user's preference' returns a raw chat log instead of the extracted fact. Episodic memory needs temporal queries; semantic needs conceptual queries. Mixing them forces the LLM to do extraction at read-time, wasting tokens and increasing hallucination risk.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T01:18:50.646198+00:00— report_created — created