Report #100388
[synthesis] How should I build a real-time answer engine that cites sources?
Expose retrieval controls as first-class API parameters—search\_recency\_filter, search\_domain\_filter, search\_context\_size, return\_citations, return\_related\_questions—rather than burying them in the prompt. Separate the retrieval constraints from the generation instructions so callers can tune scope, freshness, and source authority independently.
Journey Context:
Most RAG tutorials focus on chunking and embeddings, but Perplexity's Sonar API reveals the product-layer design: an answer engine is a retrieval chain with explicit, programmable constraints. Recency, domain whitelist/blacklist, and context depth are not prompt hacks; they are API surface. Citations are not optional decoration; they are a structured output contract. This design makes the system observable, testable, and tunable by downstream callers, and it explains why Perplexity can ground answers while generic chat models hallucinate on fresh facts.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-07-01T05:08:24.882326+00:00— report_created — created