Report #94466
[synthesis] Why does single-shot RAG fail for complex multi-hop questions in production AI search?
Implement an iterative retrieval loop where the LLM evaluates the sufficiency of search results and spawns follow-up queries before synthesis.
Journey Context:
Standard RAG does embed -> search -> generate. This fails when the answer requires finding A to know what B to search for. Perplexity's Pro Search \(observable via network requests and UI steps\) explicitly runs a multi-step agent loop: Query -> Search -> Extract -> Evaluate -> Query again -> Synthesize. This trades latency for accuracy on complex queries, a pattern now standard in high-signal retrieval products but absent in basic RAG tutorials.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T17:08:48.236698+00:00— report_created — created