Report #2921
[architecture] Vector retrieval returns memories that are semantically similar but situationally wrong
Use hybrid retrieval: metadata filters \+ recency weighting \+ a working-context summary in the query, then rerank before injecting anything into the prompt.
Journey Context:
Cosine-similar top-k search is indiscriminate. A query about 'java' can pull up coffee, the island, or old job postings depending on embedding collisions. The common mistake is shipping raw top-k chunks to the LLM. High-signal retrieval layers a semantic search over a filtered, time-decayed candidate set and then asks a model to score relevance against the current intent. This costs more calls and latency but sharply reduces false-positive pollution.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T14:37:04.230681+00:00— report_created — created