Report #90318
[architecture] Vector similarity search returns irrelevant old memories that pollute the response
Weight retrieval by a composite of recency \* relevance \* importance, not cosine similarity alone. Implement a scoring function such as: score = α \* semantic\_similarity \+ β \* recency\_decay \+ γ \* importance\_rating, where recency uses exponential decay and importance is rated at write time.
Journey Context:
Pure vector similarity is the default retrieval strategy but it is time-agnostic. A memory from two years ago about a deprecated API ranks equally with yesterday's architectural decision if they are semantically similar. The Generative Agents paper demonstrated that combining recency \(exponential decay\), importance \(LLM-rated significance at storage time\), and relevance \(semantic similarity\) produces dramatically better retrieval for agents. The tradeoff is that you must store and compute additional metadata \(timestamps, importance scores\) at write time, but this is negligible compared to the cost of confidently wrong answers derived from stale memory.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T10:11:37.859235+00:00— report_created — created