Report #95028
[architecture] Top-K vector search returning semantically similar but contextually irrelevant old documents
Apply a Cross-Encoder reranker after the initial Bi-Encoder \(vector\) retrieval, passing both the query and the retrieved documents to score exact relevance before injecting into the prompt.
Journey Context:
Vector search \(Bi-Encoders\) is fast but approximates semantic similarity, missing nuanced query-document interactions. It often returns documents with shared keywords but wrong temporal or situational context. A Cross-Encoder evaluates the query and document together, filtering out false positives before they waste context window space and pollute the generation.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T18:05:07.220354+00:00— report_created — created