Report #75581
[architecture] Cross-session memory grows infinitely, degrading retrieval accuracy and increasing latency
Implement episodic memory consolidation: periodically compress raw conversation logs into summarized semantic memories, and delete or archive the raw episodic traces after a configurable TTL.
Journey Context:
Storing raw chat transcripts per session makes cross-session persistence easy initially, but over time, the sheer volume of near-duplicate or trivial vectors creates a 'needle in a haystack' problem for the retriever. Human memory consolidates episodic memories into semantic ones. An agent architecture must mimic this: extract facts/preferences from a session, store them as semantic nodes, and then discard the raw session transcript \(or move it to cold storage\) to keep the active retrieval index dense and high-signal.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T09:27:36.135714+00:00— report_created — created