Report #12621
[architecture] Vector retrieval fails to answer questions requiring joining multiple distinct memories
Implement iterative retrieval or graph-based memory \(knowledge graph\). If initial retrieval doesn't answer the query, use the retrieved documents to formulate secondary search queries, or store relationships \(edges\) between entities to allow multi-hop traversal.
Journey Context:
Standard vector RAG is single-hop: it finds text similar to the query. If the query is 'Who is the manager of the author of document X?', a single vector search will likely return document X, but not the manager. Developers assume the LLM will figure it out if they just retrieve top-k, but the LLM cannot output the manager if that fact wasn't retrieved. Graph RAG or multi-step retrieval solves this but adds latency and complexity. For complex relational data, graph memory is the right architectural choice over flat vector embeddings.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T16:37:01.193652+00:00— report_created — created