Report #49636
[frontier] Naive RAG retrieves irrelevant chunks and misses inter-document relationships, causing agents to hallucinate missing context in long sessions
Implement three-tier Hierarchical Contextual Retrieval \(HCR\): L1 \(Working Memory\) for recent conversation embeddings; L2 \(Episodic Cache\) for summarized past interactions with time-decay; L3 \(Semantic Core\) for knowledge graph RAG with parent-child chunk hierarchies. Promote items between tiers based on access patterns.
Journey Context:
Simple vector search fails because semantic similarity ≠ contextual relevance, and flat context windows lose temporal structure. Anthropic's Contextual Retrieval \(Oct 2024\) proved that adding explanatory context headers to chunks improves recall by 67%. Production agents now need OS-style memory hierarchies. The innovation is explicit promotion/demotion: when L1 fills, compress to L2 summaries using recursive summarization; when L3 returns chunks, promote frequently accessed ones to L1 for the session. This prevents 'context thrashing' where the agent loses the thread across long-horizon tasks. It replaces naive RAG with a managed memory architecture.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T13:47:34.792342+00:00— report_created — created