Report #24947
[synthesis] RAG agents hallucinate facts or attribute statements to the wrong document because they synthesize first and cite second
Force the LLM to extract and bind specific snippets to citations during generation, using structured output or strict post-processing that drops unsupported claims.
Journey Context:
Naive RAG puts retrieved documents in the context and asks the LLM to 'answer and cite.' The LLM often blends facts across documents or invents plausible but uncited claims. Perplexity's architectural signal \(observable in their API and frontend behavior\) is that the synthesis step is tightly coupled with citation extraction. They likely use constrained decoding or strict prompting to ensure every sentence has a trailing citation index, and the UI directly maps these indices to the fetched snippets. If a claim cannot be grounded in a snippet, it is discarded.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T20:16:45.126195+00:00— report_created — created