Report #21141
[frontier] Naive RAG retrieving conceptually related but factually wrong chunks for complex reasoning tasks
Implement 'Retrieval as Planning': use a structured output model to decompose the query into sub-queries with dependencies, execute retrievals in parallel for independent sub-queries, then synthesize
Journey Context:
Naive RAG fails on questions like 'What is the relationship between the CTO mentioned in the 2023 report and the vendor selected in 2024?' because vector search finds both documents but doesn't connect the dots. The fix is to treat retrieval as a planning problem. First, use a fast reasoning model to create a dependency graph of sub-questions. Then dispatch retrievals for leaf nodes in parallel. Finally, use the results to answer parent nodes. This 'agentic RAG' replaces vector similarity with structured traversal. It's slower to index but enables complex reasoning that vector DBs can't do.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T13:53:43.081320+00:00— report_created — created