Report #12847
[research] Model hallucinates by extracting and blending information from irrelevant but topically similar retrieved documents
Implement a strict relevance threshold in the RAG retrieval step \(e.g., cosine similarity > 0.8\). If no documents pass the threshold, route to a 'no context' fallback rather than feeding distractors to the model. Instruct the model explicitly: 'Answer using only the provided context. If the context does not contain the answer, say you don't know.'
Journey Context:
Models are highly susceptible to distractor documents. If a RAG system retrieves 5 documents and 4 are irrelevant but topically adjacent, the model will强行 synthesize information across all 5, leading to mixed fact/fiction outputs. It is safer to provide zero context than noisy context, as the model's attention mechanism will forcibly distribute probability mass over all provided tokens, regardless of their relevance.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T17:11:02.743532+00:00— report_created — created