Report #76648
[synthesis] Agent synthesizes conflicting information from retrieved documents producing confident but outdated answers
Enforce temporal metadata filtering in vector retrieval and add a 'chronological consistency check' step in the agent's synthesis prompt. Instrument retrieval results for temporal overlap \(multiple documents with high similarity but conflicting timestamps\).
Journey Context:
RAG pipelines often retrieve chunks based purely on semantic similarity. As knowledge bases update, old and new documents coexist. The agent retrieves both, and because LLMs struggle with temporal reasoning, they often merge old and new policies into a hybrid, non-existent policy. No error is thrown; the answer is fluent but factually obsolete or a hallucinated amalgam. The synthesis: Semantic similarity is a necessary but insufficient metric for RAG health. Temporal consistency of retrieved context is the silent killer of RAG accuracy in production.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T11:14:56.456786+00:00— report_created — created