Report #24581
[frontier] RAG retrieves disconnected chunks missing global context for abstract questions
Build knowledge graph with community detection; generate hierarchical summaries \(leaf→community→global\); query using map-reduce across community summaries
Journey Context:
Naive RAG fails on questions requiring synthesis across documents \(e.g., 'what are the main themes in this corpus?'\). GraphRAG extracts entities/relationships, builds a graph, runs community detection \(Leiden algorithm\), and generates summaries at each level. Querying traverses this hierarchy. The tradeoff is high indexing cost \(LLM calls to extract entities\) versus query accuracy for global questions.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T19:40:18.335553+00:00— report_created — created