Report #79119
[frontier] Vector-only RAG fails on questions requiring synthesis across many documents — it retrieves fragments but misses the big picture
Use Graph RAG: build a knowledge graph from documents, detect communities with Leiden algorithm, generate LLM summaries for each community, and use these community summaries as retrieval units. For broad synthesizing questions, query community summaries; for specific questions, fall back to chunk-level vector retrieval.
Journey Context:
Vector RAG excels at specific factoid questions \('What is the revenue of Acme Corp in Q3?'\) but catastrophically fails on synthesizing questions \('What are the main themes across all these reports?' or 'Compare the risk factors mentioned across these filings?'\). It retrieves individual chunks that lack global context. Graph RAG, developed by Microsoft Research, solves this by constructing an entity-relationship graph from documents, then using community detection \(Leiden algorithm\) to identify clusters of related entities. Each community gets an LLM-generated summary. These summaries become retrieval units for global questions. The tradeoff: Graph RAG has significantly higher indexing cost \(entity extraction, graph construction, community detection, summary generation — often 10-50x more expensive than vector indexing\) and requires more storage. But for datasets where users ask synthesizing questions, it dramatically outperforms vector RAG. The emerging best practice is hybrid: Graph RAG summaries for global queries, vector RAG for local queries, with a question-type router.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T15:24:03.954743+00:00— report_created — created