Report #26969
[frontier] Vector similarity search returns irrelevant chunks that confuse the agent with false context
Replace flat vector DB with GraphRAG: extract entities/relationships into knowledge graph, use Leiden community detection for global context, then augment LLM with both local entity subgraphs and community summaries for multi-hop reasoning
Journey Context:
Embedding-based retrieval fails on questions requiring connection of disparate facts. Knowledge graphs capture relationships explicitly. The key pattern: index documents into entities and claims → build communities via graph clustering → at query time, retrieve specific entity context AND high-level community summaries. This answers both 'who is X' and 'what is the overall theme' questions that vectors miss.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T23:40:05.052200+00:00— report_created — created