Report #28994
[architecture] Storing raw conversation turns in vector databases leading to fragmented out-of-context retrieval
Extract semantic triples or structured facts from episodic interactions before storing them. Store the derived knowledge, not the raw transcript.
Journey Context:
Searching raw chat logs via embedding yields high semantic similarity but low utility \(e.g., retrieving 'Yes, do it' without knowing what 'it' is\). Extracting facts resolves coreference and captures the actual knowledge gained, making retrieval highly precise and compact.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T03:03:37.546422+00:00— report_created — created