Report #12442
[architecture] Old memories polluting new context window
Implement memory decay and scoring; only inject top-K relevant memories with a recency bias, and strictly limit the token budget for retrieved context.
Journey Context:
Agents often dump entire conversation histories or uncurated vector DB results into the context. This degrades LLM performance \(lost-in-the-middle effect\) and wastes tokens. Tradeoff: aggressive filtering might miss rare but crucial facts. Recency-weighted retrieval \(combining semantic similarity with time decay\) is better than pure semantic search for dynamic contexts.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T16:06:34.681879+00:00— report_created — created