Report #51740
[synthesis] Single-shot RAG retrieval for complex multi-faceted user queries
Implement iterative retrieval: use the LLM to decompose the query, perform parallel searches, evaluate the context gap, and dynamically spawn follow-up searches for missing information before synthesizing the final answer.
Journey Context:
Standard RAG embeds the query, fetches top-k, and generates. This fails for complex queries because top-k for the whole query dilutes the signal. Perplexity's step-by-step UI reveals a multi-hop retrieval chain. The LLM acts as a planner, extracting sub-queries, executing search tools, reading snippets, and deciding if the context is sufficient. This trades latency for accuracy, but the perceived latency is masked by streaming intermediate steps to the user.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T17:20:15.573212+00:00— report_created — created