Report #43799
[architecture] Old memories injected into context window degrade current reasoning
Apply a multi-factor retrieval score combining relevance, recency, and importance before injecting memories into the prompt.
Journey Context:
Naive RAG fetches top-k vectors by similarity, but older or trivial memories often semantically match the query while being practically useless, polluting the context and confusing the LLM. By weighting retrieval scores with exponential decay for time and an LLM-generated importance score, you ensure only highly relevant, recent, and important memories consume context tokens.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T03:59:17.513926+00:00— report_created — created