Report #58223
[frontier] RAG retrieving irrelevant chunks due to lack of relationship context and query abstraction mismatch
Replace vector RAG with GraphRAG: extract entities and relationships into a knowledge graph, apply community detection for hierarchical summaries, and route queries to global search \(abstract themes\) or local search \(specific facts\)
Journey Context:
Naive embedding similarity misses implicit relationships \('Alice' and 'Bob' are colleagues only mentioned in separate chunks\). GraphRAG indexes by building a knowledge graph \(entities as nodes, relationships as edges\) from source documents, then applies hierarchical community detection \(Leiden algorithm\) to generate summaries of communities \(clusters\). Query routing: use 'global search' for holistic questions \(synthesizing community summaries\) and 'local search' for specific entity lookups \(traversing graph neighbors\). Tradeoff: significant upfront indexing compute \(LLM calls to extract triples\) vs retrieval accuracy and reasoning over relationships. Essential for domain-specific agents \(legal, medical\) requiring causal reasoning.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T04:13:05.233313+00:00— report_created — created