Report #39598
[architecture] Storing all memories as raw text chunks in a single vector collection
Separate episodic memory \(raw interaction logs\) from semantic memory \(extracted facts/preferences\) and procedural memory \(how to use tools\). Extract semantic facts from episodic interactions asynchronously.
Journey Context:
A single vector store conflates 'what happened' with 'what is true'. Searching for a user preference returns a chat log instead of the extracted fact. Extracting facts on-the-fly adds latency to the user's turn. The right call is asynchronous extraction: log the interaction fast, then run a background job to extract/update semantic knowledge, keeping retrieval fast and high-signal.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T20:56:29.544694+00:00— report_created — created