Report #3506
[architecture] Agent uses semantic similarity as the only retrieval signal
Combine vector similarity with metadata filters, recency weighting, and structured access patterns. Use keyword/phrase search for exact identifiers, dates, and code symbols; reserve embeddings for paraphrase and concept matching.
Journey Context:
Vector search answers 'what is like this?' but fails at 'what is exactly this?' An agent asking 'what did we decide about /api/v2/users?' needs the exact endpoint, not a semantically close but wrong one. Pure embedding retrieval also misses rare terms and conflates negation. Production memory systems blend dense, sparse, and filtered retrieval — for example, LangChain's time-weighted vector store retriever and hybrid search in Vespa/Pinecone. The architecture decision is to treat embeddings as one signal in a ranking ensemble, not the retrieval layer itself.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T17:28:15.365650+00:00— report_created — created