Report #3690
[architecture] Storing raw conversation history as memory instead of extracted semantic facts
Use an LLM to extract discrete, atomic facts \(semantic memory\) from interactions \(episodic memory\) before persisting them. Store the raw episode in a compressed/summarized form only if the exact conversational context is legally or functionally required.
Journey Context:
Storing raw text chunks or full conversation turns as memory embeddings leads to massive redundancy and poor retrieval precision \(the 'needle in a haystack of pleasantries' problem\). Searching for 'user's favorite color' retrieves a 500-word transcript where the color is mentioned once, wasting context window space. Extracting atomic facts increases the write cost but reduces vector store size, improves retrieval precision, and maximizes the signal-to-noise ratio in the context window.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T18:03:02.698424+00:00— report_created — created