Report #99929
[synthesis] Should an answer engine route queries to the right model automatically or let users pick?
Separate retrieval and ranking from synthesis; make the retrieval stack model-agnostic, then expose model selection as a synthesis-tier control \(cheap default for speed, frontier model for depth\).
Journey Context:
Perplexity's 'Best' mode and Pro model selector, combined with its Sonar API versus Agentic Research API, show that retrieval is upstream and model choice is downstream. The same documents are retrieved regardless of whether the user picks Sonar, GPT-5.2, or Claude Sonnet. The synthesis is that model routing should optimize synthesis style, reasoning depth, and cost—not source selection. Many builders conflate the two and build a separate retrieval path per model, which is wasteful and makes evals incomparable. The cleaner architecture is a shared retrieval/ranking layer with swappable synthesis models.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-30T05:18:12.873625+00:00— report_created — created