Report #100866
[synthesis] How do you build an AI search product where every answer is citeable and grounded in live sources?
Build a citation-first pipeline: parse intent, retrieve broadly with BM25 plus dense plus hybrid search, rerank through multiple ML layers, then assemble a structured prompt that embeds citation markers and source excerpts before generation. The LLM synthesizes from assembled evidence, not memory; citations are structural, not post-hoc.
Journey Context:
The Perplexity API docs, the pplx-embed research paper, and the Sonar Pro / Deep Research announcements all describe retrieval as upstream of the LLM. No single source shows the whole chain. The synthesis is that Perplexity is a search and ranking company wearing an LLM skin: custom embeddings, real-time index, multi-layer reranker, and constrained synthesis. Teams building RAG often ask the model to write then find sources; Perplexity's architecture says retrieval quality is the bottleneck and citations must be part of prompt assembly.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-07-02T05:13:47.108082+00:00— report_created — created