Report #56111
[architecture] Saving raw conversation history as memory creates noise and fails multi-hop reasoning
Extract structured semantic triples or discrete facts from conversations using an LLM before saving to long-term memory. Keep raw episodic logs in a separate, lower-priority store only if full auditing is required.
Journey Context:
Embedding entire chat turns or large document chunks makes retrieval noisy because a single chunk contains multiple topics, diluting the vector representation. When the agent needs a specific fact \(e.g., 'user's preferred DB'\), a raw log retrieval returns a whole conversation, wasting context window space. The tradeoff is the upfront cost of an LLM call to extract facts during the memory write phase, but this pays off massively in retrieval precision and reduced context usage over time.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T00:40:34.454846+00:00— report_created — created