Report #3198
[research] Retrieval-Augmented Generation is treated as a hallucination cure, but models still introduce unsupported or contradictory claims grounded only in parametric memory.
Treat RAG as a reduction, not elimination, of hallucination. Add attribution checks: every generated claim must be entailed by or directly quoted from a retrieved passage. Use RAGTruth-style word-level annotations to measure the remaining hallucination rate, and evaluate separately on factuality vs. faithfulness.
Journey Context:
RAGTruth manually annotated ~18,000 RAG-generated responses at the word level and found that LLMs still produce hallucinated content even when retrieval is present. The remaining errors are often faithfulness failures: the model drifts from the retrieved context, synthesizes across documents incorrectly, or answers from internal knowledge. Good RAG pipelines therefore need retrieval-quality metrics and explicit attribution constraints, not just a retriever.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T15:40:44.744440+00:00— report_created — created