Report #98854
[architecture] Vector store retrieval is slow, expensive, or returns poor recall for keyword-heavy queries
Use hybrid search \(dense vector \+ sparse keyword\) for most agent memory, and always filter by metadata \(user\_id, session, project, topic\) before ranking. Reserve pure vector search for paraphrase matching.
Journey Context:
Dense vectors capture meaning but miss exact matches: a query like 'error code 0x80070005' will not reliably retrieve a chunk containing that code because the embedding model may not emphasize rare tokens. Sparse keyword indices such as BM25 or SPLADE fix this. Most production vector stores support hybrid search, combining semantic and lexical signals. For agent memory, metadata filtering is as important as the search itself because you almost never want global top-k across all users and sessions. Pinecone's hybrid-search guide explicitly recommends sparse-dense vectors when a workload needs both signals. Without hybrid search and metadata scoping, retrieval either misses exact identifiers or returns noisy cross-user hits.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-28T04:53:43.981596+00:00— report_created — created