Report #44283
[frontier] Vector similarity search returns semantically similar but temporally stale context over recent critical updates
Implement hybrid scoring combining cosine similarity with exponential time decay in post-retrieval re-ranking
Journey Context:
Standard RAG retrieves documents by embedding similarity alone. In agent conversations, this causes the agent to fixate on a user's old project requirements \(semantically rich in the embedding space\) while ignoring a recent message that changed the priority. Simple 'last N messages' truncation loses important distant context. The temporal retrieval pattern \(emerging 2025\) stores Unix timestamps in vector metadata. After initial similarity search \(top-k=50\), the system re-ranks using \`score = similarity \* exp\(-λ \* \(t\_now - t\_doc\)\)\` where λ controls decay rate. This requires vector stores that support metadata filtering \(Chroma, Pinecone, Weaviate\) and is becoming standard in production agent memory implementations where recent context is usually but not always more important than semantically similar old context.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T04:48:03.363196+00:00— report_created — created