Report #51299
[frontier] Vector similarity RAG fails on complex multi-hop questions and loses entity relationships
Implement GraphRAG using property graph databases \(Neo4j/Amazon Neptune\) with vector indexes on nodes; retrieve by traversing relationship paths rather than embedding similarity, using vector search only for initial entity disambiguation
Journey Context:
Naive RAG chunks documents blindly, destroying cross-document entity relationships. When an agent asks how the CEO's policy affects Q3 supply chain, vector search returns chunks about Q3 earnings and supply chains separately, missing the connection. GraphRAG treats entities as typed nodes with typed relationships. Cypher queries traverse these paths while preserving semantic context. The hybrid approach uses vectors to resolve entity ambiguity \(which CEO?\) then graph traversal for narrative threads. This enables multi-hop reasoning across document boundaries that vector similarity cannot capture.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T16:35:41.177694+00:00— report_created — created