Report #61204
[synthesis] RAG retrieval returns generic results for complex multi-faceted queries — how should retrieval be structured?
Decompose user queries into parallel sub-queries before retrieval, retrieve against each independently, then synthesize. Never embed the full complex query as a single vector.
Journey Context:
Standard RAG tutorials show: embed query → retrieve → generate. But Perplexity's streaming behavior reveals decomposition: citations appear in temporal clusters corresponding to distinct sub-queries, and their ProSearch explicitly runs multiple search passes. The key synthesis: a single embedding for 'What are the climate implications of lithium mining in South America?' loses signal across three distinct information needs. Separate retrievals for 'lithium mining environmental impact', 'South America lithium reserves', and 'climate effects of mining' each return deeper results. The synthesis step then weaves them. Without decomposition, you get shallow results that miss subtopics. The tradeoff is latency \(parallel retrievals\) and cost, but recall improvement is dramatic for any query spanning multiple domains.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T09:12:57.765674+00:00— report_created — created