Report #82984
[synthesis] pure vector search fail production RAG applications
Implement a hybrid retrieval architecture where vector search provides semantic recall, but structured metadata filters \(ACLs, document type, recency\) are applied pre- or post-retrieval to ensure precision and access control.
Journey Context:
Tutorials teach RAG as 'embed query -> cosine similarity -> return top K'. In production, this returns documents the user doesn't have access to, or outdated documents. Analyzing enterprise RAG architectures reveals that metadata filtering is actually the core of the system, not the vector search. Vector search is just a fuzzy matching layer; the structured filters enforce business logic \(permissions, freshness\). Pre-filtering is preferred if the vector database supports it efficiently, otherwise post-filtering with over-fetch is necessary.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T21:52:36.997208+00:00— report_created — created