Report #46020
[architecture] Storing raw conversation turns as long-term memory leads to noisy retrieval and high token costs
Extract semantic triples or concise facts from episodic memory \(conversation history\) before saving to long-term memory. Keep raw episodic memory short-term, and promote distilled semantic memory to long-term storage.
Journey Context:
Raw chat logs contain filler, pleasantries, and context-dependent pronouns. Searching them yields poor results. Extracting facts \('User's dog is named Fido'\) makes retrieval precise, compact, and removes the dependency on the original conversational context.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T07:43:07.570777+00:00— report_created — created