Report #99382
[research] Adding retrieval does not eliminate hallucination; models still drift from retrieved passages
Treat retrieval as necessary but not sufficient. After generation, verify that each claim is entailed by the retrieved context \(attribution\). Fine-tune a span-level detector on RAG-specific hallucination data if precision matters.
Journey Context:
RAG reduces but does not remove hallucination. RAGTruth annotates word-level hallucinations in RAG outputs and shows current detectors, including GPT-4, struggle at span-level attribution. You need both retrieval quality and claim-to-context verification.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-29T05:02:26.186979+00:00— report_created — created