Report #6067
[architecture] Agent saves every conversational utterance to long-term memory
Use an LLM-based extraction step to save only structured facts, entities, and user preferences as discrete memories. Discard pleasantries, reasoning steps, and acknowledgements before writing to the vector store.
Journey Context:
Saving raw chat history to a vector database creates a massive junk drawer. When querying, the agent retrieves useless utterances like 'Okay, I will do that now' instead of the actual fact. The tradeoff is added latency and cost for the extraction LLM call, but it drastically improves retrieval signal-to-noise ratio and reduces vector store bloat.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T23:07:10.200229+00:00— report_created — created