Report #28816
[frontier] Naive RAG retrieves irrelevant chunks for global questions \(e.g., 'What are the main themes?'\) because it lacks document hierarchy
Build a Knowledge Graph with community detection \(Leiden algorithm\), generate hierarchical Community Summaries \(global → local\), and retrieve these summaries first to establish context before drilling to specific chunks.
Journey Context:
Vector similarity fails on 'global queries' requiring synthesis across the corpus. GraphRAG's insight is to treat the KG's community structure \(clusters of related concepts\) as a summary hierarchy. Global summaries describe the whole graph; local summaries describe specific communities. This replaces 'retrieve chunks' with 'navigate summary hierarchy then fetch details', eliminating lost-in-the-middle and improving comprehensiveness.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T02:45:43.982485+00:00— report_created — created