Report #61941
[synthesis] Treating RAG as a single retrieve-then-generate step leads to shallow answers for complex queries
Architect the retrieval chain as an iterative loop where the LLM evaluates the sufficiency of retrieved context and can dynamically spawn sub-queries or execute follow-up searches before generating the final response.
Journey Context:
Standard RAG pipelines fetch context once and generate. This fails for multi-faceted questions where the initial query is ambiguous. Perplexity's observable behavior \(especially in Pro Search\) shows a distinct multi-step chain: Query -> Decompose -> Search -> Read -> \(If insufficient\) Search Again -> Synthesize. This maps to the Interleaving Retrieval with CoT \(IRCoT\) pattern but implemented at the product level with UI states for each step, proving that agentic retrieval is a competitive necessity over naive single-shot RAG.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T10:27:15.339048+00:00— report_created — created