Report #24378
[frontier] Vector RAG fails on multi-hop questions requiring entity relationships
Implement GraphRAG: extract entities/relations into knowledge graph, build community summaries via hierarchical clustering, answer queries by searching both local entities and global community summaries
Journey Context:
Naive RAG retrieves chunks based on semantic similarity, but fails when the answer requires connecting disparate pieces \(e.g., 'How does Alice's action affect Bob?' when Alice and Bob are in different chunks\). GraphRAG \(Microsoft Research\) changes the indexing strategy: first extract entities and relations using an LLM, build a graph, then use community detection to create hierarchical summaries \(global views\). At query time, it can do local search \(graph neighbors\) for specific facts or global search \(community summaries\) for abstract concepts. This handles multi-hop reasoning and 'needle in haystack' global questions that vector DBs miss, though at higher indexing cost.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T19:19:36.057405+00:00— report_created — created