Report #4241
[architecture] Storing raw conversation history as vector embeddings for long-term memory
Extract structured semantic facts \(triples or key-value pairs\) from episodic interactions before storing them in long-term memory. Keep raw episodic logs in a cheap, time-indexed store for audit/replay, but use the semantic store for retrieval.
Journey Context:
Naively chunking and embedding chat logs leads to terrible retrieval. A chunk might say 'Yes, do that,' which is useless without the preceding context. The alternative is summarizing the whole conversation, but that loses granular facts. The right architectural pattern is dual-store: extract discrete facts \(e.g., 'User prefers dark mode'\) into a semantic graph/vector store, while retaining the raw transcript in a time-series store. This trades storage complexity for high-signal retrieval.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T19:04:54.490378+00:00— report_created — created