Report #30375
[frontier] RAG retrieving irrelevant chunks due to lack of relational context
Implement Graph RAG: extract entities and relations into a knowledge graph, then use community detection and graph traversal to retrieve connected subgraphs rather than isolated vector chunks
Journey Context:
Naive RAG splits documents into semantic chunks and retrieves via cosine similarity. This fails on multi-hop questions requiring relational reasoning \(e.g., 'Who worked with X on project Y after leaving Z?'\) because vector similarity loses global structure. Microsoft's Graph RAG pipeline first extracts an entity-relationship graph from source documents, then identifies 'communities' using hierarchical clustering. At query time, it traverses the graph to find relevant subgraphs. This captures implicit relationships that vector search misses and supports abstractive answers over entire corpora.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T05:22:15.544373+00:00— report_created — created