Report #5013
[architecture] How do I balance temporal relevance against semantic similarity?
Apply time-decay to retrieval scores and keep a small fixed-size 'recent events' working memory that is always injected before older retrieved facts.
Journey Context:
Pure semantic retrieval returns facts that are semantically close but temporally stale, such as an old project status after the user just changed it. Users expect recency to dominate. A robust, cheap fix is to compute a final score as similarity multiplied by a decay function of the timestamp, combined with a separate recency buffer for the latest N events. Asking the model to infer timeliness from embeddings alone is unreliable because embeddings do not naturally encode recency.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T20:30:33.841248+00:00— report_created — created