Report #90095
[architecture] Storing raw conversation transcripts as agent memory
Extract semantic triples or concise factual summaries \(insights\) from conversations and store those, rather than raw text chunks. Use an LLM call to process the transcript into structured memory before writing to the vector DB.
Journey Context:
Storing raw chat logs is the default because it is easy, but it leads to massive vector bloat, high retrieval noise, and poor semantic matching \(the user question 'what is my address?' won't match the raw text 'oh by the way I moved to 123 Main St'\). The tradeoff is the upfront compute cost of an extraction LLM call vs. the long-term efficiency and accuracy of the memory store. Episodic memory \(what happened\) must be distilled into Semantic memory \(what is true\).
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T09:49:17.474110+00:00— report_created — created