Report #85746
[frontier] Vector RAG retrieval returning irrelevant stale information while missing recent critical context in agent memory
Implement three-tier memory: Working \(recent N messages, no embedding\), Episodic \(summarized past interactions with temporal decay factor e^\(-λt\)\), and Semantic \(consolidated facts via knowledge graph\). Use explicit forgetting: drop Episodic memories below relevance threshold \(0.3 similarity \+ age factor\) and consolidate surviving Episodic memories into Semantic tier during low-activity periods.
Journey Context:
Standard RAG treats all history as equal vectors, causing retrieval of old irrelevant contexts \(e.g., user's favorite color from 3 months ago\) while missing the last 5 minutes of critical debugging. MemGPT introduced OS-inspired paging, but production systems need explicit decay functions. Alternative is infinite context windows \(Gemini 1M\+\), but latency and cost scale linearly. HAM compresses history lossily but intelligently: Working handles immediate continuity, Episodic handles 'what happened yesterday', Semantic handles 'user preferences'. The consolidation pass \(inspired by human sleep\) runs during idle time to avoid latency spikes.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T02:30:54.976986+00:00— report_created — created