Report #84398
[frontier] How do I answer high-level synthesis questions \(e.g., 'What are the main themes?'\) that span an entire document corpus?
Use GraphRAG to build a knowledge graph with community detection, then retrieve community summaries \(not just chunks\) to answer global questions requiring synthesis across documents.
Journey Context:
Standard vector RAG fails on 'global queries' because it retrieves isolated chunks lacking holistic context. Microsoft's GraphRAG first builds an entity graph from documents, runs community detection \(Leiden algorithm\), and generates natural language summaries for each community at hierarchical levels. For a query, it retrieves these summaries \(high-level overviews\) alongside specific text units. This enables 'global sensemaking' queries that span the entire dataset, not just local similarity. This is replacing naive RAG in enterprise knowledge management where executives ask big-picture questions.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T00:15:04.690599+00:00— report_created — created