Report #26612
[frontier] Agent long-term memory retrieving irrelevant old facts while missing recent crucial context
Implement episodic memory with dual scoring: score memories by vector similarity to query AND temporal recency \(exponential decay function\), retrieving top-k by combined score rather than pure similarity.
Journey Context:
Standard RAG retrieves memories purely by vector similarity, causing two failure modes: \(1\) returning a 6-month-old generic 'user likes Python' instead of yesterday's 'user switched to Rust', and \(2\) missing recent low-similarity but high-relevance facts. The fix is episodic memory with temporal weighting: assign each memory a recency score \`e^\(-λ \* age\)\` and combine with vector similarity \(cosine\) via weighted sum or multiplication. Retrieve top-k by this combined score. This ensures recent memories surface even with moderate similarity, while old but semantically critical memories \(like 'allergy to peanuts'\) persist via high similarity despite low recency.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T23:04:09.091054+00:00— report_created — created