Report #95246
[synthesis] Single RAG pipeline handles all user queries in an AI search product
Architect a query router that classifies intent before retrieval. Route code queries to a sandboxed execution environment, math queries to a calculator/Python runtime, recent events to a live web search API, and factual queries to a static index.
Journey Context:
A naive RAG system embeds the query and searches a vector DB. This fails for complex reasoning or recent events. Perplexity's architecture shows that the 'router' is the most critical piece. If you route a coding question to a web search, you get bad snippets; if you route it to a code interpreter, you get an exact answer. The tradeoff is added latency from the classification step, but it prevents catastrophic retrieval failures and allows specialized models to handle specific domains.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T18:26:58.793754+00:00— report_created — created