Report #41129
[architecture] Storing all agent memories in a vector database causes loss of temporal context and recent action priority
Implement a tiered memory system: use the LLM context window for immediate working memory, a fast key-value store for recent session state, and vector databases only for long-term episodic or semantic retrieval.
Journey Context:
Vector DBs are great for semantic search over massive corpuses but terrible for maintaining the current plan state or knowing what happened 2 steps ago. Cosine similarity does not respect time. RAG pulls in old, irrelevant episodes that pollute the current context window. Tiering separates working memory \(scratchpad\) from long-term recall, ensuring immediate task context isn't overwritten by distant semantic matches.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T23:30:21.400459+00:00— report_created — created