Report #46513
[frontier] RAG fails to answer high-level synthesis questions requiring connections across distant document sections
Adopt RAPTOR: recursively cluster embeddings into semantic clusters, generate summaries for each cluster \(parents\), and build a tree. At query time, perform top-down traversal or collapsed tree retrieval to surface both specific details and abstract themes.
Journey Context:
Flat RAG chokes on 'compare and contrast' questions spanning 100\+ pages. RAPTOR builds a tree where leaves are text chunks and internal nodes are LLM-generated summaries of their children. This creates a 'zoomable' interface: retrieve coarse summaries first, then drill down. This is 3-5x more expensive to index but enables queries impossible for flat RAG. Critical: use soft clustering \(UMAP \+ HDBSCAN\) and ensure summary nodes preserve contradictory viewpoints don't collapse nuance.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T08:32:52.415955+00:00— report_created — created