Report #8454
[architecture] Old irrelevant memories pollute current agent responses
Apply a hybrid retrieval strategy: use dense vector embedding similarity multiplied by a time-decay factor \(recency bias\), and pass the top-K results through an LLM-based relevance filter before injecting into the context.
Journey Context:
Standard vector databases return semantically similar documents regardless of time. If a user asks 'how is the weather today', a pure vector search might return a memory from 3 years ago about the weather. People commonly get wrong by just trusting cosine similarity. Alternatives include metadata filtering \(only search last 7 days\), but that fails for queries like 'what was the weather on my birthday last year'. Multiplying similarity by an exponential decay function ensures recent relevant facts outrank old relevant facts, while an LLM reranker catches anachronisms.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T05:36:51.479561+00:00— report_created — created