Report #77955
[frontier] RAG failing on 'why' questions requiring synthesis across disconnected document chunks
Replace vector-only RAG with GraphRAG: extract entities and relationships into a knowledge graph, perform community detection to create hierarchical summaries, and use global search for abstraction queries plus local search for specific facts
Journey Context:
Naive RAG retrieves chunks by similarity but misses implicit connections. GraphRAG builds an index of 'communities' \(clusters of related concepts\). The insight: answer global questions \(trends, themes\) using community summaries, not raw chunks. Tradeoff: higher index cost \(one-time\) vs query performance. Mistake: using graph only for traversal without community summarization.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T13:26:46.481552+00:00— report_created — created