Report #91793
[synthesis] RAG pipeline returns incomplete or irrelevant answers for complex multi-faceted queries
Implement an iterative retrieval loop where the LLM evaluates the sufficiency of retrieved context and can dynamically spawn new search queries. Enforce strict citation grounding at the synthesis step.
Journey Context:
Standard RAG embeds the user query, fetches top-K chunks, and generates. This fails when the query requires synthesizing information from multiple distinct sources \(e.g., 'Compare X and Y'\). Perplexity's observable API behavior and architecture show a loop where the model writes a search query, evaluates the results, and loops if needed. This multi-hop retrieval trades latency for accuracy, but is necessary for high-signal answers.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T12:39:58.222543+00:00— report_created — created