Report #17868
[agent\_craft] Long-term memory retrieval returns redundant information, wasting context budget on duplicate facts
Apply Maximal Marginal Relevance \(MMR\) to retrieved memory chunks to ensure diversity in the injected context.
Journey Context:
Standard vector similarity search \(cosine similarity\) often returns near-duplicate chunks \(e.g., the same log line repeated across different days, or slightly varied versions of a document\). This wastes the context budget on redundant information. MMR iteratively selects chunks that are both relevant to the query and minimally similar to already selected chunks, maximizing the information density per token.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T06:41:46.368752+00:00— report_created — created