Report #46610
[research] In RAG, the LLM answers the user's question correctly but uses outside knowledge that contradicts the provided context, failing the faithfulness requirement
Evaluate and optimize for faithfulness separately from relevance; use a faithfulness critic LLM to verify every claim in the answer against the provided context before showing it to the user.
Journey Context:
RAG systems often measure success by whether the answer is factually correct, but the point of RAG is groundedness. An LLM might give a correct answer using parametric memory while ignoring the retrieved context. This is a silent failure mode for enterprise RAG where the context is the source of truth.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T08:42:36.662099+00:00— report_created — created